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Preface 


This text provides a bridge from “sophomore” calculus to graduate courses 
which use analytic ideas (e.g., real and complex analysis, partial and ordi- 
nary differential equations, numerical analysis, fluid mechanics, and differen- 
tial geometry). For a two-semester course, the first semester should end with 
Chapter 8. For a three-quarter course, the second quarter should begin in 
Chapter 6 and end somewhere in the middle of Chapter 11. 

Our presentation is divided into two parts. The first half, Chapters 1 through 
7 together with Appendices A and B, gradually introduces the central ideas of 
analysis in a one-dimensional setting. The second half, Chapters 8 through 14 
together with Appendices C through F, covers multidimensional theory. 

More specifically, Chapter 1 introduces the real number system as a com- 
plete, ordered field; Chapters 2 through 5 cover calculus on the real line; and 
Chapters 6 and 7 discuss infinite series, including uniform and absolute conver- 
gence. Chapter 8 gives a short introduction to the algebraic structure of R”, 
including the connection between linear functions and matrices. 

At that point instructors have two options. They can cover Chapter 9 to 
explore topology and convergence in the concrete Euclidean space setting, or 
they can cover these same concepts in the abstract metric space setting (Chap- 
ter 10). Since either of these options provides the necessary foundation for the 
rest of the book, instructors are free to choose the approach they feel best suits 
their aims. 

With this background material out of the way, Chapters 11 through 13 develop 
the machinery and theory of vector calculus. Chapter 14 gives a short introduc- 
tion to Fourier series, including summability and convergence of Fourier series, 
growth of Fourier coefficients, and uniqueness of trigonometric series. 

Separating the one-dimensional from the n-dimensional material is not the 
most efficient way to present the material, but it does have two advantages. The 
more abstract, geometric concepts can be postponed until students have been 
given a thorough introduction to analysis on the real line. And, students have 
two chances to master some of the deeper ideas of analysis (e.g., convergence of 
sequences, limits of functions, and uniform continuity). 

We have made this text flexible in another way by including core material and 
enrichment material. The core material provides a foundation for the typical 
one-year course in analysis. Besides making the book a better reference, the 
enrichment material has been included for two other reasons: curious students 
can use it to delve deeper into the core material or as a jumping off place to 
pursue more general topics, and instructors can use it to supplement their course 
or to add variety from year to year. 

Enrichment and optional materials are marked with an asterisk. Exercises 
which use enrichment material are also marked with an asterisk, and the 
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material needed to solve them is cited in the Answers and Hints section. To 
make course planning easier, each enrichment section begins with a statement 
which indicates whether that section uses material from any other enrichment 
section. Since no core material depends on enrichment material, any of the 
latter can be skipped without loss in the integrity of the course. 

Most enrichment sections (5.5, 5.6, 6.5, 6.6, 7.5, 9.3, 11.6, 12.6, 14.1) are 
independent and can be covered in any order after the core material which pre- 
cedes them has been dealt with. Sections 9.8 and 12.5 require 9.3, Section 14.3 
requires 5.5 only to establish Lemma 14.24. This result can easily be proved 
for continuously differentiable functions, thereby avoiding mention of functions 
of bounded variation. In particular, the key ideas in Section 14.3 can be cov- 
ered without the background material from Section 5.5 anytime after finishing 
Chapter 7. 

Since for many students this is the last (for some the only) place to see 
a rigorous development of vector calculus, we focus our attention on classi- 
cal, nitty-gritty analysis. By avoiding abstract concepts such as vector spaces 
and the Lebesgue integral, we have room for a thorough, comprehensive 
introduction. We include sections on improper integration, the gamma func- 
tion, Lagrange multipliers, the Inverse and Implicit Function Theorem, Green’s 
Theorem, Gauss’s Theorem, Stokes’s Theorem, and a full account of the change 
of variables formula for multiple Riemann integrals. 

We assume the reader has completed a three-semester or four-quarter 
sequence in elementary calculus. Because many of our students now take their 
elementary calculus in high school (where theory may be almost nonexistent), 
we assume that the reader is familiar only with the mechanics of calculus, i.e., 
can differentiate, integrate, and graph simple functions of the form y = f(x) or 
z = f(x, y). We also assume the reader has had an introductory course in lin- 
ear algebra, i.e., can add, multiply, and take determinants of matrices with real 
entries, and are familiar with Cramer’s Rule. (Appendix C, which contains an 
exposition of all definitions and theorems from linear algebra used in the text, 
can be used as review if the instructor deems it necessary.) 

Always we emphasize the fact that the concepts and results of analysis 
are based on simple geometric considerations and on analogies with material 
already known to the student. The aim is to keep the course from looking like 
a collection of tricks and to share enough of the motivation behind the mathe- 
matics so that students are prepared to construct their own proofs when asked. 
We begin complicated proofs with a short paragraph (marked STRATEGY:) 
which shows why the proof works; for example, the Archimedean Princi- 
ple (Theorem 1.16), Density of Rationals (Theorem 1.18), Cauchy’s Theorem 
(Theorem 2.29), Change of Variables in R (Theorem 5.34), Riemann’s 
Theorem about rearrangements (Theorem 6.29), the Implicit Function Theorem 
(Theorem 11.47), the Borel Covering Lemma (Lemma 9.26), and the fact that 
a curve is smooth when ¢’ 4 0 (Remark 13.10). We precede abstruse defini- 
tions or theorems with a short paragraph which describes, in simple terms, what 
behavior we are examining, and why (e.g., Cauchy sequences, one-sided limits, 
upper and lower Riemann sums, the Integral Test, Abel’s Formula, uniform con- 
vergence, the total derivative, compact sets, differentiable curves and surfaces, 
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smooth curves, and orientation equivalence). And we include examples to show 
why each hypothesis of a major theorem is necessary (e.g., the Nested Inter- 
val Property, the Bolzano—Weierstrass Theorem, the Mean Value Theorem, the 
Heine-Borel Theorem, the Inverse Function Theorem, the existence of exact 
differentials, and Fubini’s Theorem). 

Each section contains a collection of exercises which range from very elemen- 
tary (to be sure the student understands the concepts introduced in that sec- 
tion) to more challenging (to give the student practice in using these concepts to 
expand the theory). To minimize frustration, some of the more difficult exer- 
cises have several parts which serve as an outline to a solution of the problem. 
To keep from producing students who know theory but cannot apply it, each set 
of exercises contains a mix of computational and theoretical assignments. (Exer- 
cises which play a prominent role later in the text are marked with a box. These 
exercises are an integral part of the course and all of them should be assigned.) 

Since many students have difficulty reading and understanding mathematics, 
we have paid close attention to style and organization. We have consciously 
limited the vocabulary, kept notation consistent from chapter to chapter, and 
presented the proofs in a unified style. Individual sections are determined by 
subject matter, not by length of lecture, so that students can comprehend related 
results in a larger context. Examples and important remarks are numbered 
and labeled so that students can read the text in small chunks. (Many of these, 
included for the student’s benefit, need not be covered in class.) Paragraphs are 
short and focused so that students are not overwhelmed by long-winded expla- 
nations. To help students discern between central results and peripheral ones, 
the word Theorem is used relatively sparingly; preliminary results and results 
which are only used in one section are called Remarks, Lemmas, and Examples. 
And we have broken with tradition by stating definitions explicitly with an “if 
and only if.” (How can we chide our students for using the converse of a result 
when it appears that we do so about half the time we apply a definition?) 


NEW TO THIS EDITION 


We have changed many of the computational exercises so that the answers 
are simpler and easier to obtain. We have replaced most of the beginning 
calculus-style exercises with slightly more conceptual exercises that emphasize 
the same ideas, but at a higher level. We have added many theoretical exer- 
cises of medium difficulty. We have scattered true-false questions throughout 
the first six chapters. These are designed to confront common misconceptions 
that some students tend to acquire at this level. We have gathered introduc- 
tory material that was scattered over several sections into a new section entitled 
Introduction. This section includes two accessible examples about why proof is 
necessary and why we cannot always trust what we see. We have reduced the 
number of axioms from four to three by introducing the Completeness Axiom 
first, and using it to prove the Well-Ordering Principle and the Principle of 
Mathematical Induction. We have moved Taylor’s Formula from Chapter 7 to 
Chapter 4 to offer another example of the utility of the Mean Value Theorem. 
We have given the Heine—Borel Theorem its own section and included several 
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exercises designed to give students practice in making a local condition on 
a compact set into a global one. We have reorganized Section 12.1 (Jordan 
regions) to simplify the presentation and make it easier to teach. We have 
omited Chapter 15, And we have corrected a number of misprints. 

We wish to thank Mr. P. W. Wade and Professors S. Fridli, G. S. Jordan, 
Mefharet Kocatepe, J. Long, M. E. Mays, M. S. Osborne, P. W. Schaefer, F. E. 
Schroeck, and Ali Sinan Sertoz, who carefully read parts of the first edition 
and made many valuable suggestions and corrections. Also, I wish to express 
my gratitude to Ms. C. K. Wade for several lively discussions of a pedagogi- 
cal nature, which helped shape the organization and presentation of this mate- 
rial. I wish to thank Der-Chen Chang (Georgetown University), Wen D. Chang 
(Alabama State), Patrick N. Dowling (Miami University), Jeffery Ehme 
(Spelman College), Dana S. Fine (University of Massachusetts-Dartmouth), 
Stephen Fisher (Northwestern University), Scott Fulton (Clarkson University), 
Kevin Knudson (Mississippi State University), Maria Nogin (California State 
University- Fresno), Gary Weiss (University of Cincinnati), Peter Wolfe (Uni- 
versity of Maryland), and Mohammed Yahdi (Ursinus College) who looked at 
the fourth edition while it was in manuscript form and did some pre-revision 
reviews. I wish to thank Professor Stan Perrine (Charleston Southern Uni- 
versity) for checking the penultimate version of the manuscript for accuracy 
and typographical errors. Finally, I wish to make special mention of Professor 
Lewis Lum, (Portland State University) who continues to make many careful 
and perspicuous comments about style, elegance of presentation, and level of 
rigor which have found their way into this fourth edition. 

If there remain any typographical errors, I plan to keep an up-to-date list at 
my Web site. If you find errors which are not listed at that site, please contact 
me via e-mail at iwade@utk.edu. 


William R. Wade 
Mathematics Department 


University of Tennessee 
Knoxville, TN 37996-1300 
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CHAPTER 1 


The Real Number System 


You have already had several calculus courses in which you evaluated limits, 
differentiated functions, and computed integrals. You may even remember 
some of the major results of calculus, such as the Chain Rule, the Mean Value 
Theorem, and the Fundamental Theorem of Calculus. Although you are proba- 
bly less familiar with multivariable calculus, you have taken partial derivatives, 
computed gradients, and evaluated certain line and surface integrals. 

In view of all this, you must be asking: Why another course in calculus? The 
answer to this question is twofold. Although some proofs may have been pre- 
sented in earlier courses, it is unlikely that the subtler points (e.g., completeness 
of the real numbers, uniform continuity, and uniform convergence) were cov- 
ered. Moreover, the skills you acquired were mostly computational; you were 
rarely asked to prove anything yourself. This course develops the theory of cal- 
culus carefully and rigorously from basic principles and gives you a chance to 
learn how to construct your own proofs. It also serves as an introduction to 
analysis, an important branch of mathematics which provides a foundation for 
numerical analysis, functional analysis, harmonic analysis, differential equations, 
differential geometry, real analysis, complex analysis, and many other areas of 
specialization within mathematics. 


1.1 INTRODUCTION 


Every rigorous study of mathematics begins with certain undefined concepts, 
primitive notions on which the theory is based, and certain postulates, properties 
which are assumed to be true and given no proof. Our study will be based on 
the primitive notions of real numbers and sets, which will be discussed in this 
section. 

We shall use standard notation for sets and real numbers. For example, R or 
(—oo, 00) represents the set of real numbers, % represents the empty set (the set 
with no elements), a € A means that a is an element of A, anda ¢ A means that 
ais not an element of A. We can represent a given finite set in two ways. We can 
list its elements directly, or we can describe it using sentences or equations. For 
example, the set of solutions to the equation x” = 1 can be written as 


{1} or (nea =, 


A set A is said to be a subset of a set B (notation: A C B) if and only if every 
element of A is also an element of B. If A is a subset of B but there is at least 
one element b € B that does not belong to A, we shall call A a proper subset of 
B (notation: A Cc B). Two sets A and B are said to be equal (notation: A = B) 
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if and only if A C B and B C A. If A and B are not equal, we write A # B. 
A set A is said to be nonempty if and only if A 4 @. 

The union of two sets A and B (notation: A U B) is the set of elements x such 
that x belongs to A or B or both. The intersection of two sets A and B (notation: 
AN B) is the set of elements x such that x belongs to both A and B. The com- 
plement of B relative to A (notation: A \ B, sometimes B° if A is understood) 
is the set of elements x such that x belongs to A but does not belong to B. For 
example, 


{—1, 0, 1} U {1, 2} = {-1, 0, 1, 2}, {-1, 0, 1} 9 {1, 2} = {1}, 
{1,2} \ {-1,0, 1} = {2} and {—1,0, 1} \ {1,2} = {-1, O}. 


Let X and Y be sets. The Cartesian product of X and Y is the set of ordered 
pairs defined by 


XxY:={@,y):xEX,yeY}. 


(The symbol := means “equal by definition” or “is defined to be.”) Two points 
(x,y), (z, w) € X x Y are said to be equal if and only if x = z and y= w. 

Let X and Y be sets. A relation on X x Y is any subset of X x Y. Let R bea 
relation on X x Y. The domain of FR is the collection of x € X such that (x, y) 
belongs to R for some y € Y. The range of R is the collection of y € Y such 
that (x, y) belongs to R for some x € X. When (x, y) € R, we shall frequently 
write xRy. 

A function f from X into Y (notation: f : X > Y)isarelation on X x Y whose 
domain is X (notation: Dom(f) := X) such that for each x € X there is a unique 
(one and only one) y € Y that satisfies (x, y) € f. If (x, y) € f, we shall call y 
the value of f at x (notation: y = f(x) or f : x +> y) and call x a preimage 
of y under f. We said a preimage because, in general, a point in the range 
of f might have more than one preimage. For example, since sin(kz) = 0 for 
k =0,+1, +2,..., the value 0 has infinitely many preimages under f(x) = sinx. 

If f is a function from X into Y, we will say that f is defined on X and call Y 
the codomain of f. The range of f is the collection of all values of f; that is, the 
set Ran(f) := {y € Y : f(x) = y for some x € X}. In general, then, the range 
of a function is a subset of its codomain and each y € Ran(f) has one or more 
preimages. If Ran(f) = Y and each y ¢€ Y has exactly one preimage, x € X, 
under f, then we shall say that f : X — Y has an inverse, and shall define the 
inverse function f—! : Y +> X by f~'(y) := x, where x € X satisfies f(x) = y. 

At this point it is important to notice a consequence of defining a function 
to be a set of ordered pairs. By the definition of equality of ordered pairs, two 
functions f and g are equal if and only if they have the same domain, and same 
values; that is, f,g : X — Y, and f(x) = g(x) for all x € X. If they have 
different domains, they are different functions. 

For example, let f(x) = g(x) = x*. Then f : [0, 1) — [0, 1) and g : (-1,1) > 
[0, 1) are two different functions. They both have the same range, [0, 1), but each 
y € (0, 1) has exactly one preimage under f, namely ,/y, and two preimages 
under g, namely +,/y. In particular, f has an inverse function, f lax) = Jx, 
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but g does not. Making distinctions like this will actually make our life easier 
later in the course. 

For the first half of this course, most of the concrete functions we consider 
will be real-valued functions of a real variable (i.e., functions whose domains and 
ranges are subsets of R). We shall often call such functions simply real functions. 

You are already familiar with many real functions. 


1) The exponential function e* : R — (0, oo) and its inverse function, the natu- 
ral logarithm 
* dt 
log x := —, 
{ 2£ 


defined and real-valued for each x € (0, oo). (Although this last function is 
denoted by Inx in elementary calculus texts, most analysts denote it, as we 
did just now, by log x. We will follow this practice throughout this text. For a 
more constructive definition, see Exercise 4.5.5.) 

2) The trigonometric functions (whose formulas are) represented by sin x, cos x, 
tanx, cotx, secx, and csc x, and the inverse trigonometric functions arcsin x, 
arccos x, and arctan x whose ranges are, respectively, [—2/2, 7/2], [0,7], and 
(—2/2, 7/2). 

3) The power functions x“, which can be defined constructively (see 
Appendix A.10 and Exercise 3.3.11) or by using the exponential function: 


@ 1 gtlogx x>0, aeER. 


xX 
We assume that you are familiar with the various algebraic laws and identities 
that these functions satisfy. A list of the most widely used trigonometric identi- 
ties can be found in Appendix B. The most widely used properties of the power 
functions are x° = 1 for all x 4 0; x” = x-...+x (there are n factors here) 
when n = 1,2,... andx € R; x > 0, x%- x8 = x%+8 and (x%)? = x” for all 
x > Oanda,B € R; x* = %/x when a = 1/m for some m €= 1,2,... and the 
indicated root exists and is real; and 0% := 0 for all a > 0. (The symbol 0° is left 
undefined because it is indeterminate [see Example 4.31].) 
We also assume that you can differentiate algebraic combinations of these 
functions using the basic formulas (sin x)’ = cos x, (cosx)/ = —sinx, and (e*)’ = 
e*, for x € R; (logx)! = 1/x and (x%)’ = ax%~!, for x > O anda € R; and 


-) (2n + 1)a 


(tanx)’=sec*x forx 4 5 : eZ. 


(You will have an opportunity to develop some of these rules in the exercises, 
e.g., see Exercises 4.2.9, 4.4.6, 4.5.3, 5.3.7, and 5.3.8.) Even with these assump- 
tions, we shall repeat some material from elementary calculus. 

We mentioned postulates in the opening paragraph. In the next two sections, 
we will introduce three postulates (containing a total of 13 different properties) 
which characterize the set of real numbers. Although you are probably already 
familiar with all but the last of these properties, we will use them to prove other 
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equally familiar properties (e.g., in Example 1.4 we will prove that if a 4 0, then 
a“ > 0). 

Why would we assume some properties and prove others? At one point, 
mathematicians thought that all laws about real numbers were of equal weight. 
Gradually, during the late 1800s, we discovered that many of the well-known 
laws satisfied by R are in fact consequences of others. The net result of this 
research is that the 13 properties listed below are considered to be fundamental 
properties describing R. All other laws satisfied by real numbers are secondary 
in the sense that they can be proved using these fundamental properties. 

Why would we prove a law that is well known, perhaps even “obvious”? Why 
not just assume all known properties about R and proceed from there? We 
want this book to be reasonably self-contained, because this will make it easier 
for you to begin to construct your own proofs. We want the first proofs you 
see to be easily understood, because they deal with familiar properties that are 
unobscured by new concepts. But most importantly, we want to form a habit of 
proving all statements, even seemingly “obvious” statements. 

The reason for this hard-headed approach is that some “obvious” statements 
are false. For example, divide an 8 x 8-inch square into triangles and trapezoids 
as shown on the left side of Figure 1.1. Since the 3-inch sides of the triangles 
perfectly match the 3-inch sides of the trapezoids, it is “obvious” that these tri- 
angles and trapezoids can be reassembled into a rectangle (see the right side of 
Figure 1.1). Or is it? The area of the square is 8 x 8 = 64 square inches but the 
area of the rectangle is 5 x 13 = 65 square inches. Since you cannot increase 
area by reassembling pieces, what looked right was in fact wrong. By comput- 
ing slopes, you can verify that the rising diagonal on the right side of Figure 1.1 
is, in fact, four distinct line segments that form a long narrow diamond which 
conceals that extra one square inch. 


NOTE: Reading a mathematics book is different from reading any other kind 
of book. When you see phrases like “you can verify” or “it is easy to see,” you 
should use pencil and paper to do the calculations to be sure what we’ve said is 
correct. 


Here is another example. Grab a calculator and graph the functions y = 
logx and y = '%x. It is easy to see, using calculus, that logx and '%/x are 
both increasing and concave downward on [0, co). Looking at the graphs (see 
Figure 1.2), it’s “obvious” that log x is much larger than '“/x no matter how big 
x is. Or is it? Let’s evaluate each function at e!9: log(e!°) = 1000 log e = 1000 


is much smaller than '/e!000 = ¢10 ~ 22, 000. Evidently, the graph of y = '%/x 


FIGURE 1.1 
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FIGURE 1.2 


eventually crosses that of y = logx. With a little calculus, you can prove that 
logx < 'W/x forever after that (see Exercise 4.4.6a). 

What can be learned from these examples? We cannot always trust what 
we think we see. We must, as above, find some mathematical way of testing 
our perception, either verifying that it is correct, or rejecting it as wrong. This 
type of phenomenon is not a rare occurrence. You will soon encounter several 
other plausible statements that are, in fact, false. In particular, you must harbor 
a skepticism that demands proofs of all statements not assumed in postulates, 
even the “obvious” ones. 

What, then, are you allowed to use when solving the exercises? You may use 
any property of real numbers (e.g.,2-+3 = 5, 2 < 7, or V2 is irrational) without 
reference or proof. You may use any algebraic property of real numbers involv- 
ing equal signs [e.g., (x + y)* = x7 + 2xy + y? or (x + y)(x — y) = x? — y?] 
and the techniques of calculus to find local maxima or minima of a given func- 
tion without reference or proof. After completing the exercises in Section 1.2, 
you may also use any algebraic property of real numbers involving inequalities 
(e.g.,0 < a < bimplies 0 < a* < b* for all x > 0) without reference or proof. 


1.2 ORDERED FIELD AXIOMS 


In this section we explore the algebraic structure of the real number system. We 
shall assume that the set of real numbers, R, is a field (i.e., that R satisfies the 
following postulate). 


Postulate 1. [FIELD AXIOMS]. There are functions + and -, defined on R? := 
R x R, which satisfy the following properties for every a,b,c € R: 


Closure Properties. a + b anda -b belong to R. 

Associative Properties. a+ (b +c) =(a+b)+canda-:(b-c)=(a-b)-c. 
Commutative Properties.a+b=b+aanda-b=b.-a. 

Distributive Law. a-(b+c)=a-b+a-c. 
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Existence of the Additive Identity. There is a unique element 0 € R such that 
0+a=aforallaeR. 


Existence of the Multiplicative Identity. There is a unique element 1 € R such 
that! 4Oand1-a=aforallaeR. 


Existence of Additive Inverses. For every x € R there is a unique element 
—x € Rsuch that 


x+(-x) =0. 


Existence of Multiplicative Inverses. For every x € R \ {0} there is a unique 
element x~! € R such that 


x a=. 

We note in passing that the word unique can be dropped from the statements 
in Postulate 1 (see Appendix A). 

We shall usually denote a + (—b) by a — b, a-b by ab, a! by i or 1/a, and 
a-b—' by ; or a/b. Notice that by the existence of additive and multiplicative 
inverses, the equation x + a = Ocan be solved for each a € R, and the equation 
ax = | can be solved for each a € R provided that a 4 0. 


From these few properties (i.e., from Postulate 1), we can derive all the usual 
algebraic laws of real numbers, including the following: 


(-1)7 = 1, (1) 
0-a=0, -a=(-l)-a, —(-a)=a, aeéR, (2) 
—(a—b)=b-a, a,beR, (3) 

and 
a,be€R and ab=0 imply a=0 or b=0. (4) 


We want to keep our attention sharply focused on analysis. Since the proofs 
of algebraic laws like these lie more in algebra than analysis (see Appendix A), 
we will not present them here. In fact, with the exception of the absolute value 
and the Binomial Formula, we will assume all material usually presented in a 
high school algebra course (including the quadratic formula and graphs of the 
conic sections). 

Postulate 1 is sufficient to derive all algebraic laws of R, but it does not com- 
pletely describe the real number system. The set of real numbers also has an 
order relation (i.e., a concept of “less than”). 


Postulate 2, [ORDER AXIOMS]. There is a relation < on R x R that has the 
following properties: 


Trichotomy Property. Given a, b € R, one and only one of the following state- 
ments holds: 


a<b, b<a, or a=b. 


Transitive Property. For a,b,c € R, 


a<b and b<c imply a<c. 
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The Additive Property. For a,b,c ¢ R, 
a<b and ceR imply at+c<b+e. 
The Multiplicative Properties. For a, b,c € R, 
a<b and c>0O imply ac <be 


and 


a<b and c<0O imply be <ac. 


By b > awe shall meana < b. Bya < band b > a we shall meana < b or 
a=b. Bya < b <c weshall meana < bandb < c. In particular, 2 < x < | 
makes no sense at all. 


WARNING. There are two Multiplicative Properties, so every time you multiply 
an inequality by an expression, you must carefully note the sign of that expression 
and adjust the inequality accordingly. For example, x < 1 does NOT imply that 


x? <x unless x > 0. If x < 0, then by the Second Multiplicative Property, x < 1 


implies x* > x. 


We shall call a number a € R nonnegative if a > 0 and positive if a > 0. 
Postulate 2 has a slightly simpler formulation using the set of positive elements 
as a primitive concept (see Exercise 1.2.11). We have introduced Postulate 2 as 
above because these are the properties we use most often. 

The real number system R contains certain special subsets: the set of natural 
numbers 


Ne {1256.24 


obtained by beginning with 1 and successively adding 1s to form 2 := 1+1, 3 := 
2+ 1, and so on; the set of integers 


Doe OE est 


(Zahl is German for number); the set of rationals (or fractions or quotients) 
m 
Q:= |= :m,néZandn #0}; 
n 


and the set of irrationals 
Q°=R\Q. 
Equality in Q is defined by 


i= Sfand only if mq =np. 
n q 
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Recall that each of the sets N, Z, Q, and R is a proper subset of the next; that is, 


NcZcQCR. 


! is a real 


For example, every rational is a real number (because m/n := mn— 
number by Postulate 1), but /2 is an irrational. 
Since we did not really define N and Z, we must make certain assumptions 


about them. If you are interested in the definitions and proofs, see Appendix A. 


1.1 Remark. We will assume that the sets N and Z satisfy the following 
properties. 


i) Ifn,m € Z, thenn+m, n—m, and mn belong to Z. 
ii) Ifn € Z, thenn € Nif and only ifn > 1. 
iii) There is non € Z that satisfies 0 <n <1. 


Using these properties, we can prove that Q satisfies Postulate 1 (see Exer- 
cise 1.2.9). 

We notice in passing that none of the other special subsets of R satisfies Postu- 
late 1. N satisfies all but three of the properties in Postulate 1: N has no additive 
identity (since 0 ¢ N), N has no additive inverses (e.g., —1 ¢ N), and only one 
of the nonzero elements of N (namely, 1) has a multiplicative inverse. Z sat- 
isfies all but one of the properties in Postulate 1: Only two nonzero elements 
of Z have multiplicative inverses (namely, 1 and —1). Q° satisfies all but four 
of the properties in Postulate 1: Q° does not have an additive identity (since 
0 ¢ R \ Q), does not have a multiplicative identity (since 1 ¢ R \ Q), and does 
not satisfy either closure property. Indeed, since V2 is irrational, the sum of 
irrationals may be rational (./2 + (—V/2) = 0) and the product of irrationals may 
be rational (./2- /2 = 2). 

Notice that any subset of R satisfies Postulate 2. Thus Q satisfies both Pos- 
tulates 1 and 2. The remaining postulate, introduced in Section 1.3, identifies a 
property that Q does not possess. In particular, Postulates 1 through 3 distin- 
guish R from each of its special subsets N, Z, Q, and Q°. These postulates actu- 
ally characterize R; that is, R is the only set that satisfies Postulates 1 through 3. 
(Such a set is called a complete Archimedean ordered field. We may as well 
admit a certain arbitrariness in choosing this approach. R has been developed 
axiomatically in at least five other ways [e.g., as a one-dimensional continuum 
or as a Set of binary decimals with certain arithmetic operations]. The decision 
to present R using Postulates 1 through 3 is based partly on economy and partly 
on personal taste.) 

Postulates 1 and 2 can be used to derive all identities and inequalities which 
are true for real numbers [e.g., see implications (5) through (9) below]. Since 
arguments based on inequalities are of fundamental importance to analysis, we 
begin to supply details of proofs at this stage. 

What is a proof? Every mathematical result (for us this includes examples, 
remarks, lemmas, and theorems) has hypotheses and a conclusion. There are 
three main methods of proof: mathematical induction, direct deduction, and 
contradiction. 
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Mathematical induction, a special method for proving statements that depend 
on positive integers, will be covered in Section 1.4. 

To construct a deductive proof, we assume the hypotheses to be true and pro- 
ceed step by step to the conclusion. Each step is justified by a hypothesis, a 
definition, a postulate, or a mathematical result that has already been proved. 
(Actually, this is usually the way we write a proof. When constructing your own 
proofs, you may find it helpful to work forward from the hypotheses as far as 
you can and then work backward from the conclusion, trying to meet in the 
middle.) 

To construct a proof by contradiction, we assume the hypotheses to be true, 
the conclusion to be false, and work step by step deductively until a contra- 
diction occurs; that is, a statement that is obviously false or that is contrary to 
the assumptions made. At this point the proof by contradiction is complete. The 
phrase “suppose to the contrary” always indicates a proof by contradiction (e.g., 
see the proof of Theorem 1.9). 

What about false statements? How do we “prove” that a statement is false? 
We can show that a statement is false by producing a single, concrete example 
(called a counterexample) that satisfies the hypotheses but not the conclusion 
of that statement. For example, to show that the statement “x > 1 implies 
x? — x —2 40” is false, we need only observe that x = 2 is greater than 1 but 
27-2-2=0. 

Here are some examples of deductive proofs. (Note: The symbol Ml indicates 
that the proof or solution is complete.) 


1.2 EXAMPLE. 
Ifa € R, prove that 


a#0 implies a* >0. (5) 


In particular, —1 <0 <1. 


Proof. Suppose that a 4 0. By the Trichotomy Property, either a > 0 or 
a <0. 

Case 1. a > 0. Multiply both sides of this inequality by a, using the First 
Multiplicative Property. We obtain a* = a-a > 0-a. Since (by (2)),0-a =0 
we conclude that a* > 0. 

Case 2. a < 0. Multiply both sides of this inequality by a. Since a < Q, it 
follows from the Second Multiplicative Property that a2 = a-a >0-a=0. 
This proves that a? > 0 whena # 0. 

Since 1 ¥ 0, it follows that 1 = 17 > 0. Adding —1 to both sides of this 
inequality, we conclude thatO = 1—1>0-1=-1. a 


13 EXAMPLE. 
Ifa € R, prove that 


0<a<1 implies 0<a*<a and a>1 implies a*>a. (6) 
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Proof. Suppose that 0 < a < 1. Multiply both sides of this inequality by a 
using the First Multiplicative Property. We obtain 0 = 0-a < a* <1-a=a. 
In particular, 0 < a? <a. 

On the other hand, if a > 1, then a > 0 by Example 1.2 and the Transitive 
Property. Multiplying a > 1 by a, we conclude thata? =a-a>1-a=a. H 


Similarly (see Exercise 1.2.2), we can prove that 


O<a<b and O0<c<d imply ac <bd, (7) 
O0<a<b implies 0<a?<b* and 0</Ja< Vb, (8) 
and 
F : 1 1 
0<a<b_ implies Bo (9) 
a 


Much of analysis deals with estimation (of error, of growth, of volume, etc.) 
in which these inequalities and the following concept play a central role. 


1.4 Definition. 


The absolute value of a number a € R is the number 


a a>0 
la] = 
—a a<0. 


When proving results about the absolute value, we can always break the proof 
up into several cases, depending on when the parameters are positive, negative, 
or zero. Here is a typical example. 


1.5 Remark. The absolute value is multiplicative; that is, |ab| = |a| \b| for all 
a,beR. 


Proof. We consider four cases. 

Case 1.a = 0 or b = 0. Then ab = 0, so by definition, |ab| = 0 = |a| |b]. 

Case 2.a > Oandb > 0. By the First Multiplicative Property, ab > 0-b = 0. 
Hence by definition, |ab| = ab = |a| |b]. 

Case 3. a > Oandb < 0, or, b > Oanda < 0. By symmetry, we may 
suppose that a > 0 and b < 0. (That is, if we can prove it fora > 0 and b < 0, 
then by reversing the roles of a and b, we can prove it fora < 0 andb > 0.) 
By the Second Multiplicative Property, ab < 0. Hence by Definition 1.4, (2), 
associativity, and commutativity, 


|ab| = —(ab) = (—1)(ab) = a((-1)b) = a(—b) = Ia] |DI. 
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Case 4. a < Oand b < 0. By the Second Multiplicative Property, ab > 0. 
Hence by Definition 1.4, 


|ab| = ab = (—1)?(ab) = (—a)(—b) = |a| |}. = 


We shall soon see that there are more efficient ways to prove results about 
absolute values than breaking the argument into cases. 

The following result is useful when solving inequalities involving absolute 
value signs. 


1.6 Theorem. [FUNDAMENTAL THEOREM OF ABSOLUTE VALUES]. 
Leta € Rand M > 0. Then |a| < M if and only if -M <a<M. 


Proof. Suppose first that |a| <M. Multiplying by -1, we also have —|a|>—M. 
Case 1. a => 0. By Definition 1.4, |a| = a. Thus by hypothesis, 


—-M <0<a=l|a|<M. 
Case 2. a < 0. By Definition 1.4, |a| = —a. Thus by hypothesis, 
—-M <-|a|=a<0<M. 
This proves that —M <a < M ineither case. 
Conversely, if -M <a < M,thena < M and —M <a. Multiplying the 


second inequality by —1, we have —a < M. Consequently, |a| = a < M if 
a > 0, and |a| = —a < Mifa <0. 


NOTE: Ina similar way we can prove that |a| < M if and only if -M <a < M. 


Here is another useful result about absolute values. 


1.7 Theorem. The absolute value satisfies the following three properties. 

i) [Positive DEFINITE] For alla € R, |a| => 0 with |a| = 0 if and only if a = 0. 
ii) [Symmetric] For alla,b €R, |a —b| = |b—al. 
iii) [TRIANGLE INEQUALITIES] For all a,b € R, 


la+b| <|a|+|b| and | \a|—|b| |< |a—5l. 


Proof. i) Ifa > 0, then |a| = a > 0. Ifa < 0, then by Definition 1.4 and the 


Second Multiplicative Property, |a| = —a = (—l)a > 0. Thus |a| > 0 for all 
aeéR. 
If |a| = 0, then by definition a = |a| = 0 whena > O anda = —|a| = 0 


when a < 0. Thus |a| = 0 implies that a = 0. Conversely, |0| = 0 by definition. 
ii) By Remark 1.5, |a —b| =|—1| |b-—a| = |b—-a|. 
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iti) To prove the first inequality, notice that |x| < |x| holds for any x € R. 
Thus Theorem 1.6 implies that —|a| < a < |a| and —|b| < b < |b|. Adding 
these inequalities (see Exercise 1.2.1), we obtain 


—(la| + |b|) <a+b < |a| +0. 


Hence by Theorem 1.6 again, |a + b| < |a| + |b]. 
To prove the second inequality, apply the first inequality to (a — b) +b. We 
obtain 


|a| — |b] = la —b+ | — |b] < |a —b| + |b] — |b] = |a — OD]. 
By reversing the roles of a and b and applying part ii), we also obtain 
|b| — la] < |b—al| = |a — Dl. 


Multiplying this last inequality by —1 and combining it with the preceding one 
verifies 
—la —b| < |a| — |b] < la — Ol. 


We conclude by Theorem 1.6 that | ja| — |D| | <la—D|. | 


Notice once and for all that this last inequality implies that |a| — |b| < |a — D| 
for alla, b € R. We will use this inequality several times. 


WARNING. Some students mistakenly mix absolute values and the Additive 
Property to conclude that b < c implies |a + b| < |a+c|. It is important from the 
beginning to recognize that this implication is false unless both a + b anda +c 
are nonnegative. For example, ifa = 1, b = —5, andc = —1, then b < c but 
la + b| = 4 is not less than |a + c| = 0. 


A correct way to estimate using absolute value signs usually involves one of 
the triangle inequalities. 


18 EXAMPLE. 
Prove that if —2 < x < 1, then |x? —x| <6. 
Proof. By hypothesis, |x| < 2. Hence by the triangle inequality and 
Remark 1.5, 
|x? — x] < |x|? + |x] <442=6. | 


The following result (which is equivalent to the Trichotomy Property) will be 
used many times in this and subsequent chapters. 


1.9 Theorem. Letx,y,aéR. 
i) x <y+e foralle > Oif and only if x < y. 


ii) x > y—e foralle > Oif and only if x > y. 
iti) |a| < e foralle > Oifand only ifa =0. 
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Proof. i) Suppose to the contrary that x < y+e for alle > 0 butx > y. 
Set e¢g = x — y > O and observe that y + e¢9 = x. Hence by the Trichotomy 
Property, y + €9 cannot be greater than x. This contradicts the hypothesis for 
€ = €9. Thus x < y. 

Conversely, suppose that x < y and e > Ois given. Either x < yorx = y. 
Ifx < y,thenx +0 < y+0 < y+e by the Additive and Transitive Properties. 
Ifx = y, then x < y+e by the Additive Property. Thus x < y+e foralle > 0 
in either case. This completes the proof of part i). 

ii) Suppose that x > y —e for alle > 0. By the Second Multiplicative 
Property, this is equivalent to —x < —y + ¢, hence by part i), equivalent to 
—x <—y. By the Second Multiplicative Property, this is equivalent to x > y. 

iti) Suppose that |a| < « = 0+e foralle > 0. By parti), this is equivalent to 
|a| < 0. Since it is always the case that |a| > 0, we conclude by the Trichotomy 
Property that |a| = 0. Therefore, a = 0 by Theorem 1.71. a 


Let a and b be real numbers. A closed interval is a set of the form 


[a,b] := {x €R:a<x <D}, [a,oo) := {x ER: a <x}, 
(—oo, b] := {x ER: x <b}, or (—~w,ocw):=R, 


and an open interval is a set of the form 


(a,b):={xER:a<x<b}, (4,w):={x Ee R:a < x}, 
(—oco, b) := {x ER: x <b}, or (-—~w,w):=R. 


By an interval we mean a closed interval, an open interval, or a set of the form 
[a,b):={x €R:a<x<b} or (a,bl:={x Ee R:a<x <)}. 


Notice, then, that when a < b, the intervals [a,b], [a, b), (a, b], and (a, b) cor- 
respond to line segments on the real line, but when b < a, these “intervals” are 
all the empty set. 

An interval J is said to be bounded if and only if it has the form [a, b], (a, b), 
[a, b), or (a, b] for some —co < a < b < ow, in which case the numbers a,b 
will be called the endpoints of J. All other intervals will be called unbounded. 
An interval with endpoints a, b is called degenerate if a = b and nondegenerate 
if a < b. Thus a degenerate open interval is the empty set, and a degenerate 
closed interval is a point. 

Analysis has a strong geometric flavor. Geometry enters the picture because 
the real number system can be identified with the real line in such a way that 
a < b if and only if a lies to the left of b (see Figures 1.2, 2.1, and 2.2). This 
gives us a way of translating analytic results on R into geometric results on the 
number line, and vice versa. We close with several examples. 

The absolute value is closely linked to the idea of length. The length of a 
bounded interval J with endpoints a, b is defined to be |/| := |b — al, and the 
distance between any two points a, b € R is defined by |a — b|. 
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Inequalities can be interpreted as statements about intervals. By Theorem 1.6, 
la| < M if and only if a belongs to the closed interval [—M, M]; and by Theo- 
rem 1.9, a belongs to the open interval (—e, ¢) for all e > Oif and only ifa = 0. 

We will use this point of view in Chapters 2 through 5 to give geomet- 
ric interpretations to the calculus of functions defined on R, and in Chap- 
ters 11 through 13 to extend this calculus to functions defined on the Euclidean 
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spaces R”. 


EXERCISES 


In each of the following exercises, verify the given statement carefully, proceeding 
step by step. Validate each step that involves an inequality by using some statement 


found in this section. 


1.2.0 Let a, b,c,d € Rand consider each of the following statements. Decide 
which are true and which are false. Prove the true ones and give coun- 


1.2.1. 


1.2.2. 


1.2.3. 


1.2.4. 


terexamples to the false ones. 

a) Ifa <bandc <d <0, thenac > bd. 

b) Ifa < bandc > 1, then ja+c| < |b+c|. 
c) Ifa <bandb <a-+c, then ja —b| <c. 
d) Ifa < b—e foralle > 0, thena < 0. 
Suppose that a,b,c € Randa <b. 


a) Prove thata+c<b+e. 
b) Ifc > 0, prove thata-c<b-c. 


Prove (7), (8), and (9). Show that each of these statements is false if the 


hypothesis a > 0 or a > Ois removed. 


This exercise is used in Section 6.3. The positive part of ana € Ris 


defined by 
a eae 
2 
and the negative part by 
_ |al—a 
cae ac 


a) Prove thata =at —a™ and |a|=at+a-. 
b) Prove that 


+ a a>0 = 0 a>0 
at= and a = 
0) a<0O —a a<0. 


Solve each of the following inequalities for x € R. 


a) |4x —2| < 22 
b) |1—2x| <7 


1.2.5. 


1.2.6. 


1.2.7. 


1.2.8. 


1.2.9. 
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Cc) jx? — x] < x3 
2 

d) . <4 
x—2 

) 3x2 

e) —~—— < 
3x2 —3 

Leta, beR. 


a) Prove thatifa > 2andb=1+/a-—1,then2 <b <a. 

b) Prove that if2 <a <3andb=2+4 Va —2, then0 <a <b. 
c) Prove that if0 <a <landb=1--J/1-—a,then0 <b <a. 
d) Prove that if3 <a <5andb=2+4 Va —2, then3 <b <a. 


The arithmetic mean of a, b € Ris A(a, b) = (a+b)/2, and the geometric 
mean of a,b € [0,00) is G(a,b) = Vab. If 0 < a < b, prove that 
a < G(a,b) < A(a,b) < b. Prove that G(a, b) = A(a, b) if and only if 
a=b. 

Letx ER. 

a) Prove that |x| < 4 implies Fag _ 1| <5 |x —- 1]. 

b) Prove that |x| < 1 implies [see + 5x - 6| <7\|x — 1). 

c) Prove that —3 < x < 3 implies Fas + 3x — 10| <8 |x —2). 

d) Prove that —2 < x < 0 implies Le? 4235? = Gy = 8| <9.5|x +1]. 


1 


For each of the following, find all values of n € N that satisfy the given 
inequality. 
0 
< 0. 
1- 4n? 
n—1 

b) ——.— «255 

m+n? + i . 

es 
0.001 

9) n> — 2n? +4n —8 a 
a) Interpreting a rational m/n as m-n7! ¢€ R, use Postulate 1 to 


prove that 


mI 
m p mq+np m p mp m —m £ n 
= , = , and [ — =— 
n @q nq n q nq n n ( ) £ 
form,n, p,g,@€€ Zandn,qg,t £0. 
b) Using Remark 1.1, Prove that Postulate 1 holds with Q in place of R. 
c) Prove that the sum of a rational and an irrational is always irrational. 
What can you say about the product of a rational and an irrational? 

d) Let m/n, p/q € R with n, q > 0. Prove that 

mn < ifandonlyif mq <np. 

n q 


(Restricting this observation to Q gives a definition of “<” on Q.) 
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1.2.10. Prove that 
(ab + cd)” < (a? +c°)(b? +d’) 
for alla,b,c,d ER. 


1.2.11. a) Let R* represent the collection of positive real numbers. Prove that 
R¢ satisfies the following two properties. 


i) For each x € R, one and only one of the following holds: 
xeRt, -xeER™, or x=0. 


ii) Given x, y € R*, both x + y and x - y belong to R™. 


b) Suppose that R contains a subset R* (not necessarily the set of pos- 
itive numbers) which satisfies properties i) and ii). Define x ~< y by 
y —x €R®™. Prove that Postulate 2 holds with < in place of <. 


1.3 COMPLETENESS AXIOM 


In this section we introduce the last of three postulates that describe R. To 
formulate this postulate, which distinguishes Q from R, we need the following 
concepts. 


1.10 Definition. 


Let E C R be nonempty. 


i) The set E is said to be bounded above if and only if there is an M € R such 
that a < M for alla € E, in which case M is called an upper bound of E. 

ii) A number s is called a supremum of the set E if and only if s is an upper 
bound of E ands < M for all upper bounds M of E. (In this case we shall 
say that FE has a finite supremum s and write s = sup E.) 


NOTE: Because French mathematicians (e.g., Borel, Jordan, and Lebesgue) 
did fundamental work on the connection between analysis and set theory, and 
ensemble is French for set, analysts frequently use FE to represent a general set. 


By Definition 1.10ii, a supremum of a set E (when it exists) is the smallest (or 
least) upper bound of E£. By definition, then, in order to prove that s = sup E 
for some set E C R, we must show two things: s is an upper bound, AND s is 
the smallest upper bound. Here is a typical example. 


111 EXAMPLE. 
If E = [0, 1], prove that sup E = 1. 
Proof. By the definition of interval, 1 is an upper bound of £. Let M be any 


upper bound of E; that is, M > x for allx € E. Since 1 € E, it follows that 
M > 1. Thus 1 is the smallest upper bound of E. a) 
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The following two remarks answer the question: How many upper bounds 
and suprema can a given set have? 


1.12 Remark. [fa set has one upper bound, it has infinitely many upper bounds. 
Proof. If Mo is an upper bound for a set E, then sois M for any M > Mo. Wf 
113 Remark. [fa set has a supremum, then it has only one supremum. 
Proof. Let s; and s2 be suprema of the same set E. Then both s; and s> are 
upper bounds of E, whence by Definition 1.10ii, 5; < sz and sz < s,. We 


conclude by the Trichotomy Property that sj = so. a 


NOTE: This proof illustrates a general principle. When asked to prove a = b, it 
is often easier to verify that a < b and b < a separately. 


The next result, a fundamental property of suprema, shows that the supremum 
of aset FE can be approximated by a point in E (see Figure 1.3 for an illustration). 


al @ 
s1- @ 
iH @ 
oo 
Y 


points in B 


FIGURE 1.3 


1.14 Theorem. [APPROXIMATION PROPERTY FOR SUPREMA]. 
If E has a finite supremum and ¢ > 0 is any positive number, then there is a 
pointa € E such that 


sup EF —e <a <supE. 


Proof. Suppose that the theorem is false. Then there is an ¢9 > 0 such that no 
element of E lies between so := sup E — €9 and sup E. Since sup E is an upper 
bound for E, it follows that a < so for alla € E; that is, so is an upper bound 
of E. Thus, by Definition 1.1011, sup E < so = sup E — ¢9. Adding ¢9 — sup E 
to both sides of this inequality, we conclude that ¢9 < 0,acontradiction. M& 


The Approximation Property can be used to show that the supremum of any 
subset of integers is itself an integer. 
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1.15 Theorem. /f E Cc Z has a supremum, then sup E € E. In particular, if the 
supremum of a set, which contains only integers, exists, that supremum must be 
an integer. 


Proof. Suppose that s := sup E and apply the Approximation Property to 
choose an x9 € — such thats —1 <x9 <s.Ifs = xo, then s € E, as promised. 
Otherwise, s — 1 < x9 < s and we can apply the Approximation Property 
again to choose x; € E such that x9 < x1 <5. 

Subtract xo from this last inequality to obtain 0 < x1 — x9 < s — xo. Since 
—xo < 1—s, it follows that 0 < x1—x9 < s+(1—s) = 1. Thus xj—x9 € ZNO, 1), 
a contradiction by Remark 1.1iii. We conclude that s € E. a 


The existence of suprema is the last assumption about R we make. 


Postulate 3. [COMPLETENESS AXIOM]. If £ is a nonempty subset of R that 
is bounded above, then E has a finite supremum. 


We shall use Completeness Axiom many times. Our first two applications deal 
with the distribution of integers (Theorem 1.16) and rationals (Theorem 1.18) 
among real numbers. 


1.16 Theorem. [ARCHIMEDEAN PRINCIPLE]. 
Given real numbers a and b, with a > 0, there is an integer n € N such that 
b < na. 


STRATEGY: The idea behind the proof is simple. By the Completeness Axiom 
and Theorem 1.15, any nonempty subset of integers that is bounded above has 
a “largest” integer. If ko is the largest integer that satisfies kna < b, thenn = 
(ko + 1) (which is larger than ko) must satisfy na > b. In order to justify this 
application of the Completeness Axiom, we have two details to attend to: (1) Is 
the set E := {k € N: ka < b} bounded above? (2) Is E nonempty? The 
answer to the second question depends on whether b < a or not. Here are the 
details. 


Proof. Ifb < a,setn = 1. Ifa < b, consider the set FE = {k eN: ka < db}. Eis 
nonempty since 1 € E. Letk € E (i.e., ka < b). Since a > 0, it follows from 
the First Multiplicative Property that k < b/a. This proves that FE is bounded 
above by b/a. Thus, by the Completeness Axiom and Theorem 1.15, E has a 
finite supremum s that belongs to E, in particular, s € N. 

Setn =s +1. Thenn € N and (since n is larger than s), n cannot belong to 
E. Thus na > b. a 


Notice in Example 1.11 and Theorem 1.15 that the supremum of FE belonged 
to E. The following result shows that this is not always the case. 
1.17 EXAMPLE. 


Let A = {1, 5 i - ...jand B= {5 5, z,..J. Prove that sup A = sup B = 1. 
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Proof. It is clear that 1 is an upper bound of both sets. It remains to see that 
1 is the smallest upper bound of both sets. For A, this is trivial. Indeed, if 
M is any upper bound of A, then M > 1 (since 1 € A). On the other hand, 
if M is an upper bound for B, but M < 1, then 1 — M > 0. In particular, 
1/d-M) eR. 

Choose, by the Archimedean Principle, ann € Nsuchthatn > 1/(1—M). It 
follows (do the algebra) that xo := 1—1/n > M. Since xo € B, this contradicts 
the assumption that M is an upper bound of B (see Figure 1.3). | 


The next proof shows how the Archimedean Principle is used to establish 
scale. 


1.18 Theorem. [DENSITY OF RATIONALS]. 
If a,b € R satisfy a < b, then there is aq € Q such thata <q <b. 


STRATEGY: To find a fraction g = m/n such that a < q < b, we must specify 
both numerator m and denominator n. Let’s suppose first that a > 0 and that 
the set FE := {k € N: k/n < a} has a supremum, ko. Then m := ko + 1, being 
greater than the supremum of E, cannot belong to E. Thus m/n > a. Is this the 
fraction we look for? Is m/n < b? Not unless n is large enough. To see this, look 
at a concrete example: a = 2/3 and b = 1. Ifn = 1, then E has no supremum, 
When n = 2, ky = 1 and whenn = 3, ko = 2. In both cases (kg + 1)/n = 1 is 
too big. However, when n = 4, kp = 2 so (ko + 1)/4 = 3/4 is smaller than b, as 
required. 

How can we prove that for each fixed a < b there always is an n large enough 
so that if kg is chosen as above, then (kg + 1)/n < b? By the choice of ko, ko/n < 
a. Let’s look at the worst case scenario: a = ko/n. Then b > (kg + 1)/n means 


kot+l &k 1 
is Orrih 2. Us Ge at 
n n 


b 


n n 


(1.e.,b —a > 1/n). Such an n can always be chosen by the Archimedean Princi- 
ple. 

What about the assumption that sup £ exists? This requires that E be 
nonempty and bounded above. Once n is fixed, E will be bounded above by 
na. But the only way that E is nonempty is that at the very least, 1 € E (i.e., that 
1/n < a). This requires a second restriction on n. We begin our formal proof at 
this point. 


Proof. Suppose first that a > 0. Since b—a > 0, use the Archimedean 
Principle to choose ann € N that satisfies 


eer 
n> max} -, , 
a b-a 


and observe that both 1/n < aand1/n < b—a. 
Consider the set E = {k EN: k/n < a}. Since 1 € E, E is nonempty. Since 
n> 0, E is bounded above by na. Hence, by Theorem 1.15, ko := sup E exists 
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and belongs to £, in particular, to N. Set m = kg + 1 and gq = m/n. Since ko is 
the supremum of FE, m ¢ E. Thus g > a. On the other hand, since ko € E, it 
follows from the choice of n that 


k ko 


1 
b=a+(b-—a)>=> [LG a)> + er 
n noon oon 


Now suppose that a < 0. Choose, by the Archimedean Principle, an integer 
k € Nsuch thatk > —a. Then0 <k+a < k +b, and by the case already 
proved, there is anr € Qsuch thatk+a<r<k+b. Therefore, g :=r-—k 
belongs to Q and satisfies the inequality a <q <b. a 


For some applications, we also need the following concepts. 


1.19 Definition. 


Let E C R be nonempty. 


i) The set E is said to be bounded below if and only if there is an m € R such 
that a > m for all a € E, in which case m is called a lower bound of the 
set E. 

ii) A number ¢ is called an infimum of the set E if and only if t is a lower 
bound of E and t > m for all lower bounds m of E. In this case we shall say 
that E has an infimum t and write t = inf E. 

iii) E is said to be bounded if and only if it is bounded both above and below. 


When a set FE contains its supremum (respectively, its infimum) we shall fre- 
quently write max F for sup E (respectively, min E for inf E). 

(Some authors call the supremum the least upper bound and the infimum the 
greatest lower bound. We will not use this terminology because it is somewhat 
old fashioned and because it confuses some students, since the least upper bound 
of a given set is always greater than or equal to the greatest lower bound.) 

To relate suprema to infima, we define the reflection of a set E C R by 


—E:={x:x = -—aforsomeae E }. 


For example, —(1, 2] = [—2, —1). 

The following result shows that the supremum of a set is the same as the 
negative of its reflection’s infimum. This can be used to prove an Approximation 
Property and a Completeness Property for Infima (see Exercise 1.3.6). 


1.20 Theorem. [REFLECTION PRINCIPLE]. 
Let E CR be nonempty. 


i) E has a supremum if and only if —E has an infimum, in which case 


inf(—E) = —sup E. 
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ii) E has an infimum if and only if —E has a supremum, in which case 
sup(—F£) = —inf E. 


Proof. The proofs of these statements are similar. We prove only the first 
statement. 

Suppose that £ has asupremum s and set t = —s. Since s is an upper bound 
for E, s > a foralla € E,so —s < —a for alla € E. Therefore, t is a lower 
bound of —E. Suppose that m is any lower bound of —E. Then m < —a for 
alla € E,so —m is an upper bound of E. Since s is the supremum of E, it 
follows that s < —m (i.e., tf = —s > m). Thus ¢ is the infimum of —£ and 
sup E = 5 = —t = —inf(—E). 

Conversely, suppose that —E has an infimum ¢. By definition, t < —a for 
alla € E. Thus —t is an upper bound for E. Since E is nonempty, E has a 
supremum by the Completeness Axiom. a 


Theorem 1.20 allows us to obtain information about infima from results about 
suprema, and vice versa (see the proof of the next theorem). 
We shall use the following result many times. 


1.21 Theorem. [MONOTONE PROPERTY]. 
Suppose that A C B are nonempty subsets of R. 


i) If B has a supremum, then sup A < sup B. 
ii) If B has an infimum, then inf A > inf B. 


Proof. i) Since A © B, any upper bound of B is an upper bound of A. There- 
fore, sup B is an upper bound of A. It follows from the Completeness Axiom 
that sup A exists, and from Definition 1.10ii that sup A < sup B. 

ii) Clearly, —A C —B. Thus by part i), Theorem 1.20, and the Second 
Multiplicative Property, 


inf A = — sup(—A) > —sup(—B) = inf B. | 


It is convenient to extend the definition of suprema and infima to all subsets 
of R. To do this we expand the definition of R as follows. The set of extended real 
numbers is defined to be R := R | {too}. Thus x is an extended real number if 


and only if either x € R, x = +00, or x = —o. 
Let E C R be nonempty. We shall define sup E = +00 if E is unbounded 
above and inf E = —oo if E is unbounded below. Finally, we define sup 4 = —oo 


and inf@ = +o0. Notice, then, that the supremum of a subset E of R (respec- 
tively, the infimum of £) is finite if and only if EF is nonempty and bounded 
above (respectively, nonempty and bounded below). Moreover, under the con- 
vention —oco < a anda < ov for alla € R, the Monotone Property still holds 
for this extended definition; that is, if A and B are subsets of R and A C B, then 
sup A < sup B and inf A > inf B, provided we use the convention that —oo < oo. 
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EXERCISES 


1.3.0. Decide which of the following statements are true and which are false. 


1.3.1. 


1.3.2. 


1.3.4. 


1.3.5. 


1.3.6 


1.3.7. 


1.3.8. 


1.3.3). 


Prove the true ones and give counterexamples to the false ones. 


a) If A and B are nonempty, bounded subsets of R, then sup(A N B) < 
sup A. 

b) Let e be a positive real number. If A is a nonempty, bounded subset 
of R and B = {ex : x € A}, then sup(B) = é sup(A). 

c) IfA+B:= {a+b: ae Aandb € B}, where A and B are nonempty, 
bounded subsets of R, then sup(A + B) = sup(A) + sup(B). 

d) IfA—B:= {a—b: ae Aandb € B}, where A and B are nonempty, 
bounded subsets of R, then sup(A — B) = sup(A) — sup(B) 


Find the infimum and supremum of each of the following sets. 


a) E={x ER :x*+3x = 10} 

b) E={x ER :x*4+3x—10> x7 andx < 10} 
c) E={p/q €Q : 2p? —4q* > Oand0 < p < 2q} 
d) E={x ER :x =(-1)"/nforn EN} 

e) E={x ER :x=1/n+(-1)"/nforn EN} 

f) E={neN :5—(-4)"/277} 


Prove that for each a € R and eachn € N there exists a rational r, such 
that ja — ry| < 1/n. 

[DENSITY OF IRRATIONALS] This exercise is used in Section 3.3. Prove 
that if a < b are real numbers, then there is an irrational € € R such that 
a<éc<b. 

Prove that a lower bound of a set need not be unique but the infimum 
of a given set E is unique. 

Show that if E is a nonempty bounded subset of Z, then inf E exists and 
belongs to E. 


. This exercise is used in many sections, including 2.2 and 5.1. Use the 


Reflection Principle and analogous results about suprema to prove the 
following results. 


a) [APPROXIMATION PROPERTY FOR INFIMA] Prove that if a set E C R has 
a finite infimum and ¢ > 0 is any positive number, then there is a 
point a € E such that inf E +e >a > inf E. 

b) [COMPLETENESS Property FoR InFIMA] If E C R is nonempty and 
bounded below, then E has a (finite) infimum. 


a) Prove that if x is an upper bound of a set E C Rand x € E£, then x is 
the supremum of E. 

b) Make and prove an analogous statement for the infimum of E. 

c) Show by example that the converse of each of these statements is 
false. 


Suppose that E, A,B C Rand E = AUB. Prove that if E has a supre- 
mum and both A and B are nonempty, then sup A and sup B both exist, 
and sup E is one of the numbers sup A or sup B. 
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1.3.9. A dyadic rational is a number of the form k/2” for some k,n € Z. Prove 
that if a and b are real numbers and a < b, then there exists a dyadic 
rational g such that a <q <b. 

1.3.10. Let x, ¢ R and suppose that there is an M € R such that |x,| < M 
forn € N. Prove that s, = sup{xn, Xn+41,...} defines a real number for 
each n € N and that s; > so > ---. Prove an analogous result about 
ty = inf{xn, Xn41,..-}- 

1.3.11. If a,b € Rand b—a > 1, then there is at least one k € Z such that 
a<k<b. 


1.4 MATHEMATICAL INDUCTION 


In this section we introduce the method of Mathematical Induction and use it 
to prove the Binomial Formula, a result that shows how to expand powers of a 
binomial expression (i.e., an expression of the form a + b). 

We begin by obtaining another consequence of the Completeness Axiom, 
the Well-Ordering Principle, which is a statement about the existence of least 
elements of subsets of N. 


1.22 Theorem. [WELL-ORDERING PRINCIPLE]. 
If E is anonempty subset of N, then E has a least element (i.e., E has a finite 
infimum and inf E € E). 


Proof. Suppose that E C N is nonempty. Then —£ is bounded above, by 
—1, so by the Completeness Axiom sup(—£) exists, and by Theorem 1.15, 
sup(—E) € —E. Hence by Theorem 1.20, infE = —sup(—E£) exists, and 
infE ¢ —(—E)=E. | 


Our first application of the Well-Ordering Principle is called the Principle of 
Mathematical Induction or the Axiom of Induction (which, under mild assump- 
tions, is equivalent to the Well-Ordering Principle—see Appendix A). 


1.23 Theorem. Suppose for eachn € N that A(n) is a proposition (i.e., a verbal 
statement or formula) which satisfies the following two properties: 
i) A(1) is true. 
ii) For every n € N for which A(n) is true, A(n + 1) is also true. 
Then A(n) is true for alln EN. 


Proof. Suppose that the theorem is false. Then the set E = {n € N: A(n) 
is false} is nonempty. Hence by the Well-Ordering Principle, E has a least 
element, say x. 

Since x € E CN C Z@, we have by Remark 1.1ii that x > 1. Since x € E, 
we have by hypothesis i) that x # 1. In particular, x — 1 > 0. Hence, by 
Remark 1.1li and iii, x —1>landx—1eN. 

Since x — 1 < x and x is a least element of E, the statement A(x — 1) must 
be true. Applying hypothesis ii) ton = x — 1, we see that A(x) = A(n + 1) 
must also be true; that is, x ¢ E, a contradiction. | 
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Recall that if x9, x1, ..., x» are real numbers and 0 < j <n, then 


n 
So x = Xj +X j+1 tes: +Xp 
k=j 
denotes the sum of the x;’s as k ranges from j to n. The following examples 


illustrate the fact that the Principle of Mathematical Induction can be used to 
prove a variety of statements involving integers. 


1.24 EXAMPLE. 
Prove that 


n 
Yk —1)3k +2) =3n?+6n? +7 
k=1 


forn EN. 
Proof. Let A(n) represent the statement 
n 
YS \Gk = 1) Gk +2) = 3n? + 6n? +n. 
k=1 


For n = | the left side of this equation is 2 - 5 and the right side is 3 + 6+ 1. 
Therefore, A(1) is true. Suppose that A(7) is true for some n > 1. Then 


n+l n 
Yk — 1)(3k +2) = Bn +2)3n +5) + Yk — 1)(3k +2) 
k=1 k=1 


= (3n +2)(3n +5) + 3n?+ 6n7?+n 
= 3n? + 15n7+ 22n +10. 


On the other hand, a direct calculation reveals that 
3(n +1)? +6(n +1)? + (n +1) = 3n? + 15n* + 22n + 10. 


Therefore, A(m + 1) is true when A() is. We conclude by induction that A(n) 
holds for alln EN. a 


Two formulas encountered early in an algebra course are the perfect square 
and cube formulas: 


(a+b) =a? +2ab+b* and (a+b) =a? 4+ 3a°b 4+ 3ab? 4+ b°. 


Our next application of the Principle of Mathematical Induction generalizes 
these formulas from n = 2 and 3 to arbitrary n € N. 
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Recall that Pascal’s triangle is the triangular array of integers whose rows 
begin and end with 1s with the property that an interior entry on any row is 
obtained by adding the two numbers in the preceding row immediately above 
that entry. Thus the first few rows of Pascal’s triangle are as below. 


| Oe eee ae 
1464 1 
1 5 10 10 5 1 
1 6 15 20 15 6 1 


Notice that the third and fourth rows are precisely the coefficients that appeared 
in the perfect square and cube formulas above. 

We can write down a formula for each entry in each row of the Pascal triangle. 
The first (and only) entry in the first row is 


()=" 


Using the notation 0! := 1 anda! := 1-2---(7—1)-n forn € N, define the 
binomial coefficient n choose k by 


@ _ n! 
k})”) (n—k Ik! 


forO0<k<nandn=0O,l,.... 


Since i.) a ") = | for alln EN, the following result shows that the 


binomial coefficient n over k does produce the (k + 1)st entry in the (n + 1)st 
row of Pascal’s triangle. 


1.25 Lemma. 
Ifn,k € Nand 1 <k <n, then 


Proof. By definition, 


( n ) () - nik ni(n—k +1) 
x) OS Gaba DE GED 


- mar) aC) = 
~~ (n-k+Dik! Vk J 
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Binomial coefficients can be used to expand the nth power of a sum of 


two terms. 


1.26 Theorem. [BINOMIAL FORMULA]. 
Ifa,b€ R,n EN, and 0° is interpreted to be 1, then 


(a+b) = Ss jae 


k=0 


Proof. The proof is by induction on n. The formula is obvious for n = 1. 
Suppose that the formula is true for some n € N. Then by the inductive 
hypothesis and Postulate 1, 


(a+b)"*! =(@+b)a+by 


= (a+b) (> ee ) 
k=0 
= (32(,)er is (>2(p)ers#") 
eo ao 
= Ga +o(pet) fe (w +5(peroe) 
k=l k=0 
— ttl +) (i) +( 7 )) gtk+] pk 4 prt, 
= k k-1 


Hence it follows from Lemma 1.25 that 


n n+l 
oe es EWN eee 4 aid HP dN ets 
(a+b) =a + ) k a b\ +b = ) k a b*; 
k=1 k=0 


that is, the formula is true for n+1. We conclude by induction that the formula 
holds for alln €N. a 


We close this section with two optional, well-known results that further 


demonstrate the power of the Completeness Axiom and its consequences. 


*1.27 Remark. /f x > 1 and x ¢.N, then there is ann € N such thatn < x < 
n+l. 


Proof. By the Archimedean Principle, the set E = {m € N: x < m}is 
nonempty. Hence by the Well-Ordering Principle, E has a least element, 
say mo. 

Set n = mo — 1. Since mp € E, n+1 = mo > x. Since mo is least, 
n=mo—1 <x. Sincex ¢ N, we also haven 4 x. Therefore,n <x <n+1. 
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Using this last result, we can prove that the set of irrationals is nonempty. 


*1.28 Remark. Jf n € N is not a perfect square (i.e., if there is no m € N such 
that n = m7), then </n is irrational. 


Proof. Suppose to the contrary that n € Nis not a perfect square but ./n € Q; 
that is, /n = p/q for some p,g € N. Choose by Remark 1.27 an integer 
mo € N such that 


my < /n <mo +1. (10) 


Consider the set E := {k € N: k./n € Z}. Since q./n = p, we know that E 
is nonempty. Thus by the Well-Ordering Principle, FE has a least element, say 
no. 
Set x = no(./n—mo). By (10), 0 < ./n—mo < 1. Multiplying this inequality 
by no, we find that 


O<x <n. (11) 


Since no is a least element of E, it follows from (11) that x ¢ E. On the 
other hand, 


x/n = no(/n — mo)/n = non — monoV/n € Z 


since ng € E. Moreover, since x > 0 and x = ng./n — nomo is the difference 
of two integers, x ¢ N. Thus x € E, a contradiction. a 


EXERCISES 


1.4.0. Decide which of the following statements are true and which are false. 
Prove the true ones and give counterexamples to the false ones. 


a) Ifa >Oandb £0, then (a +b)" > b" foralln EN. 

b) Ifa <0 <b, then (a +b)" < b" foralln EN. 

c) If n € N is even and both a and b are negative, then (a + b)” > 
a” + na™'b. 

d) Ifa £0, then 


1 " (n\ (a — 2)"-* 
Qn = S k qugn—-k 
for alln EN. 


1.4.1. a) Prove that if x) > 2 and x,4; = 14+ /x, —1 forn € N, then 2 < 

Xn+1 < Xn holds for alln €N. 

b) Prove that if 2 < x; < 3 and x4); =2+/x, —2 forn EN, then 
0 < Xn < Xn41 holds for alln EN. 

c) Prove that if 0 < x; < land x,4; = 1—/1—-x, forn € N, then 
0 < Xn+41 < Xn holds for alln €N. 

d) Prove that if 3 < xj <5 and x,4; =2+JSx, — 2, then 3 < x41 < Xn 
holds for alln EN. 
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1.4.2. Use the Binomial Formula or the Principle of Induction to prove each 


1.4.3. 


1.4.4. 


1.4.5). 


of the following. 


a) tpi)" (;) — 0 foralln €N. 


b) (a+b)" =a" +b" for alln € Nanda,b > 0. 
c) d+ 1/n)" > 2 forallneN. 


d) yet ) = try 2* for alln EN. 


Prove each of the following statements. 


a) 2n+1 < 2” forn =3,4,.... 
b) n < 2” forn =1,2,.... 
c) n?<2"41forn =1,2,.... 
d) n? <3" forn =1,2,.... 


Parts a) and c) of this exercise are used in Sections 2.4 and 5.1. 
Prove that the following formulas hold for alln € N. 
us 1 
poy ea 
k=l 2 
b) oye = MADEN +1) 
k=1 6 
na-—l 1 
c) a k =l1- ae a # 0) 
k=1 @ a 
n n(4n? — 1) 


d) }Qk-1)7? = 
k=l 3 


This exercise is used in Section 2.3. Prove that 0 < a < b implies 0 < 
a" <b" and0 < w/a < x/b for alln € N. 


1.4.6. Prove that 2” + 3” is a multiple of 5 for all oddn EN. 


1.4.7. 


Prove that 2” < n!+2fornéeN. 


1.4.8. Prove that 


1.4.9, 


n(n — 1)(n — 2) 
> — i ee a 
6 


gn 


forn EN. 


a) Using Remark 1.28, prove that the square root of an integer m is 
rational if and only if m = k* for some k € N. 

b) Prove that /n +3 + ./n is rational for some n € N if and only if 
n=1. 


c) Find alln € N such that /n +7 4+ ./n is rational. 


1.4.10. Let ag = 3, bo = 4, and co = 5. 


a) Let ay = ag-1 +2, dy = 2ag_1 + by_1 + 2, and cy = 2ag_1 + ce_1 +2 
for k EN. Prove that cy, — by is constant for allk EN. 
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b) Prove that the numbers defined in part a) satisfy 


a ee, 
ay + de = cK 


for allk EN. 


1.5 INVERSE FUNCTIONS AND IMAGES 


Let f : X > Y (ie., suppose that f is a function from one set X to another 
set Y). In this section, we obtain simple conditions for when f has an inverse, 
introduce images and inverse images induced by /, and explore how they inter- 
act with the algebra of sets. 

First, recall from Section 1.1 that a function f : X¥ — Y has an inverse function 
if and only if Ran(f) = Y and each y ¢€ Y has a unique preimage x € X, in 
which case we define the inverse function f—' by f~'(y) := x. In particular, if 
f : X — Y has an inverse function, then 


f\F@)=x and f(f'o) =y (12) 
forallx ¢ X andyeY. 


We introduce the following concepts in order to answer the question, “Is there 
an easy way to recognize when /f has an inverse?” 


1.29 Definition. 
Let X and Y be sets and f: X > Y. 


i) f is said to be 1-1 (one-to-one or an injection) if and only if 
x1,%2 EX and Ff (x1) = Ff (x2) imply xX, =X2. 


ii) f is said to be onto (or a surjection) if and only if for each y € Y there is 
an x € X such that y = f(x). 
iii) f is called a bijection if and only if it is both 1-1 and onto. 
Sometimes, to emphasize the domain and range of f, we shall say that a 
bijection f : X > Y is 1-1 from X onto Y. 


For example, the function f(x) = x* is 1-1 from [0, 00) onto [0, oo) but not 
1-1 on any open interval containing 0. 

We shall now prove that bijections always have inverse functions and that (12) 
characterizes those inverses. 


1.30 Theorem. Let X and Y be sets and f : X — Y. Then the following three 
statements are equivalent. 


i) f has an inverse; 
ii) fis 1-1 from X onto Y; 
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iii) There is a function g : Y — X such that 
g(f(x)) =x forallx ex (13) 


and 


f(g(y)) =y forally <Y. (14) 


Moreover, for each f : X — Y, there is only one function g that satisfies (13) and 
(14). It is the inverse function f—'. 


Proof. i) implies ii). By definition, if f has an inverse, then Ran(f) = Y 
(so f takes X onto Y) and each y € Y has a unique preimage in X [so, if 
f(y) = f(Q2), then y; = yo, ie., f is 1-1 on X]. 

ii) implies iii). The proof that i) implies ii) also shows that if f : X > Y 
is 1-1 and onto, then f has an inverse. In particular, g(y) := f —!(y) satisfies 
(13) and (14) by (12). 

iii) implies i). Suppose that there is a function g : Y + X which satisfies 
(13) and (14). If some y € Y has two preimages, say x1 4 x2 in X, then 
f(@1) = y = f (x2). It follows from (13) that x1 = g(f(«1)) = g(f @2)) = x2, 
a contradiction. On the other hand, given y € Y, set x = g(y). Then f(x) = 
f(g(y)) = y by (14), so Ran(f) = Y. 

Finally, suppose that h is another function which satisfies (13) and (14), and 
fix y € Y. Byii), there is an x € X such that f(x) = y. Hence by (13), 


h(y) =h(f@) =x = 8(f@)) = 80); 
that is, 4 = g on Y. It follows that the function g is unique. | 


There are two ways to show that a given function f is 1-1 on a set X. We can 
suppose that f(x) = f (x2) for some x1, x2 € X, and prove (using algebra, for 
example) that x; = x2. If X is an interval in R and f is differentiable, there is an 
easier way to prove that f is 1-1 on X. 


1.31 Remark. Let I be an interval and let f : I — R. [If the derivative of f is 
either always positive on I, or always negative on I, then f is 1-1 on I. 


Proof. By symmetry, we may suppose that the derivative f’ of f satisfies 
f'(x) > 0 for all x € J. We will use a result that almost everyone who has 
studied one variable calculus remembers (for a proof, see Theorem 4.17): If 
f’ > 0onan interval /, then f is strictly increasing on /; that is, x1, x2 € 7 and 
xX, < x2 imply that f(x) < f(x). 

To see why this implies that f is 1-1, suppose that f(x1) = f(x2) for some 
x1,x2in X. If x; # x2, then it follows from the trichotomy property that either 
xX, < x2 OF x2 < x,. Since f is strictly increasing on /, either f(x,) < f(x2) 
or f (x2) < f(x). Both of these conclusions contradict the assumption that 


f (x1) = f (2). a 
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By Theorem 1.30, f : X — Y has an inverse function f~! if and only if 
f-'(f(@)) = x for all x € X and f(f~!(y)) = y for all y € Y. This suggests that 
we can find a formula for f~! if y = f(x) can be solved for x. 


*1.32 EXAMPLE. 
Prove that f(x) = e* — e~ is 1-1 on R and find a formula for f~! on Ran(f). 


Solution. Since f’(x) = e* +e > 0 for all x e€ R, f is 1-1 on R by 
Remark 1.31. 

Let y = e* —e™*. Multiplying this equation by e* and collecting all nonzero 
terms on one side of the equation, we have 


e* — yey _1 =0, 


a quadratic in e*. By the quadratic formula, 


x ytvy? +4 
ee (15) 


Since e* is always positive, the minus sign must be discarded. Taking the loga- 
rithm of this last identity, we obtain x = log(y + Vy? + 4) — log 2. Therefore, 


f (x) = log(x + Vx? + 4) — log2. a 


The following concepts greatly simplify the general theory of continuity (see 
Theorem 9.51, for example). 


1.33 Definition. 


Let X and Y be sets and f : X — Y. The image of aset E C X under f is the 
set 


f(A) :={y € Y: y= f(x) for some x € E}. 


The inverse image of aset E C Y under f is the set 


f \(E) = {x € X: f(x) =y forsome y € £}. (16) 


When E is an interval, we will sometimes drop the extra parentheses; for 
example, write f(a, b] for f((a, b]) and f—!(a, b] for f~!((a, d)). 
1.34 EXAMPLE. 


Find the images and inverse images of the sets J = (—1, 0) and J = (0, 1] under 
the function f(x) = x* +x. 


Solution. Since “find” doesn’t mean “prove,” we look at the graph y = x? +x. 
By definition, f(/) consists of the y-values of f(x) as x ranges over J = (—1, 0). 
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Since f has roots at x = 0, —1 and has a minimum of —0.25 at x = —0.5, it is 
clear by looking at the graph that f(/) = [—0.25, 0). Since f~!(/) consist of the 
x-values whose images belong to J] = (—1, 0), and the graph of f lies below the 
x-axis only when —1 < x < 0, it is also clear that f'(W = (-1,0). Similarly, 
f(J) = (0, 2] and 


Ms -1-J5 -14+ V5 
i (= 54 JU (0 154). 


(Be sure to look at the graph of y = x? +x and understand how these numbers 
were obtained.) a 


WARNING. Unfortunately, there are now three meanings to f~!:(1) f~!(x) = 
1/f (x), the reciprocal of f which exists when f is real-valued and f(x) 4 0; 
(2) f~'(x), the inverse function of f which exists when f is 1-1 and onto; (3) 
f (EB), the inverse image of E under f, which always exists. Context will usu- 
ally indicate which meaning we are using. 


Notice that Definition 1.33 contains an asymmetry: y € f(£) means that 
y = f(x) for some x € E, but x € f~!(E) does NOT mean that x = f~!(y) for 
some y € E. For example, let f(x) = sinx. Since sin(kz) = 0 for all k € Z, the 
inverse image of {0} under f is f~!({0}) = {ka : k € Z}, but since the range of 
aresin x is [—2/2, 2/2], the image of {0} under f~! is arcsin{0} = {0}. 

Before we give an account of how images and inverse images interact with set 
algebra (specifically, what the image and inverse image of a union, an intersec- 
tion, and a complement of sets are), we need to expand the algebra of sets to 
include unions and intersections of infinitely many sets. We need these concepts 
for some of the deeper results in the second half of this book because many of 
the proofs involve associating a set Ey with each aw in a set A. With this end in 
mind, we introduce the following terminology. 

A collection of sets € is said to be indexed by a set A if and only if there is a 
function F from A onto € (i.e., each a € A is associated with one and only one 
set in €). In this case we shall call A the index set of E, say that E is indexed by A, 
and represent F(a) by Ey. In particular, € is indexed by A means € = {Ey}wea.- 


1.35 Definition. 
Let € = {Ey}uca be a collection of sets. 


i) The union of the collection € is the set 


|) Ea = {x:xe€E, forsomeaeé A}. 


acA 
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ii) The intersection of the collection € is the set 


() Ew :={x:xeE, foralla <€ A}. 


acA 


For example, 


U [0,x) =[0,1) and () [0, x) = {O}. 


xe(0,1] xe(0,1] 


The following important, often used result shows that there is an easy way to 
get from unions to intersections, and vice versa. 


1.36 Theorem. [DEMORGAN’S LAWS]. 
Let X be a set and {Eq}uca be a collection of subsets of X. If for each E © X 
the symbol E* represents the set X\ E, then 


(U a) =( \E (17) 


aecA acA 


and 


(n ta) = ier (18) 


acA acA 


Proof. Suppose that x belongs to the left side of (17); that is, x € X and 
x € Uses Eu. By definition, x <¢ X and x ¢ Ey, for alla € A. Hence, x € E¢ 
for alla ¢€ A; that is, x belongs to the right side of (17). These steps are 
reversible. This verifies (17). A similar argument verifies (18). a 


The following result, which plays a prominent role in Chapters 9 and 12, 
describes images and inverse images of unions and intersections of sets. 


1.37 Theorem. Let X and Y be sets and f : X — Y. 
i) If {Ea}wea is a collection of subsets of X, then 


i (U ta) =|Jf(Ee) and f (n ta) < ( ) f(a). 


acA acA aeA acA 


ii) If B and C are subsets of X, then f(C\B) > f(C)\f(B). 
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iii) If {Eu}aea is a collection of subsets of Y, then 


r'(Uee)=Usrten ant (M6) = 2 eee. 


acA acA aeA acA 


iv) If B and C are subsets of Y, then f~'(C\B) = f~'(C)\f7!(B). 
v) IfE © f(X), then f(f—|(E)) = E, but if E C X, then f-'(f(E)) 2 E. 


Proof. i) By definition, y € f(Uwea Eq) if and only if y = f(x) for some x € 
Eq anda € A. This is equivalent to y € Ugea f (Eq). Similarly, y € f(Naea Ex) 
if and only if y = f(x) for some x € Nye4 Ey. This implies that for alla € A 
there is an xy € Ey such that y = f(x,). Therefore, y € Ngca f (Eg). 

ii) If y € f(C)\f(B), then y = f(c) for some c € C but y ¥ f(b) for any 


b € B. It follows that y € f(C\B). Similar arguments prove parts iii), iv), 
and v). 


It is important to recognize that the set inequalities in parts i), ii), and v) 
can be strict unless f is 1-1 (see Exercises 1.5.6 and 1.5.7). For example, if 
f(x) =x’, E; = {1}, and E> = {—1}, then f(E, N Er) = Vis a proper subset of 
f(E1) 0 f(E2) = {1}. 


EXERCISES 


1.5.0. Decide which of the following statements are true and which are false. 
Prove the true ones and give counterexamples to the false ones. 


a) Let f(x) = sinx. Then the function 


nx 30 
f: E =| > [-1, 1] 


is a bijection, and its inverse function is arcsin x. 

b) Suppose that A, B, and C are subsets of some set X and that f : 
X—> xX. THANB #4, then f(A)N f(BUC) #9. 

c) Suppose that A and B are subsets of some set X. Then (A\B)* = 
B\A. 

d) If f takes [—1, 1] onto [—1, 1], then f—!(f (0})) = {0}. 


1.5.1. a) For each of the following, prove that f is 1-1 on E and find f(£). 


a) f(x) =2x+4,FE=R 

B) f(x) =e", E =, 00) 

y) f &) = 2tanx, E = (7/2, 37/2) 

6) f (x) =x* —4x 41, E = (—o, —3] 

e) f(x) =e +2|-[Ix+ 1] +2,E=R 

ot) f@)=@4+)/(x? +1), E =[-1,0] 
*b) Find an explicit formula for f~! on f(E). 
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1.5.2. Find f(£) and f~!(E) for each of the following. 
a) f (x) =5—- 10x, E = (-2, 1) 
b) f Hat = 208 = 1,2) 
c) f (~) =4x — x’, E=[0,4) 
d) f (x) = log (x? —4x +5), E = (0,5] 
e) f (x) =sinx, E = [0, co) 
1.5.3. Give a simple description of each of the following sets. 
A) Jt Tai sed] 
xe[—1,1] 


b) ff) [k-2,x+2] 
xe[—2,2] 


Gy Ly [0, t) 
keN 
d) Abe + 
lara) 
keN 


1.5.4. Prove (18). 
1.5.5. Prove Theorem 1.37iii, iv, and v. 
1.5.6. Let f(x) =x’. 


a) Find subsets B and C of R such that f(C\B) 4 f(C)\f(B). 
b) Find a subset E of R such that f~!(f(E)) & E. 


1.5.7|. This exercise is used several times in Chapter 12. Let X, Y be sets and 
f :X — Y. Prove that the following are equivalent. 


a) fis1-lonX. 

b) f(A\B) = f(A)\f(B) for all subsets A and B of X. 

c) f-'(f(B)) = E for all subsets E of X. 

d) f(ANB) = f(A) N f(B) for all subsets A and B of X. 


1.6 COUNTABLE AND UNCOUNTABLE SETS 


In this section we will show how to use bijections to “count” infinite sets. We 
begin by examining what it means to count a finite set. When we count a 
finite set E, we assign consecutive numbers in N to the elements of £; that 
is, we construct a function f from {1,2,...,n} to E, where n is the num- 
ber of elements in E. For example, if E has three objects, then the “count- 
ing” function, f, takes {1, 2, 3} to E. Now in order to count E properly, 
we must be careful to avoid two pitfalls. We must not count any element of 
E more than once (i.e., f must be 1-1), and we cannot miss any element of 
E (ie., f must take {1, 2, 3} onto EZ). Accordingly, we make the following 
definition. 
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1.38 Definition. 


Let E be a set. 


i) E issaid to be finite if and only if either E = 9 or there exists a 1-1 function 
which takes {1,2,...,n} onto E, for somen € N. 
ii) E is said to be countable if and only if there exists a 1-1 function which 
takes N onto E. 
iii) E is said to be at most countable if and only if E is either finite or countable. 
iv) E is said to be uncountable if and only if E is neither finite nor countable. 


Loosely speaking, a set is countable if it has the same number of elements as 
N, finite if it has less, and uncountable if it has more. 

To show that a set E is countable, it suffices to exhibit a 1-1 function f from 
N onto E. For example, the set of even integers E = {2,4,...} is countable 
because f(k) := 2k is 1-1 from N onto £. Thus, two infinite sets can have the 
same number of elements even though one is a proper subset of the other. (In 
fact, this property can be used as a definition of “infinite set.”) 

The following result shows that not every infinite set is countable. 


1.39 Remark. [CANTOR’S DIAGONALIZATION ARGUMENT]. The open 
interval (0, 1) is uncountable. 


STRATEGY: Suppose to the contrary that (0, 1) is countable. Then by def- 
inition, there is a function f on N such that f(1), f(2),... exhausts the ele- 
ments of (0, 1). We could reach a contradiction if we could find a new number 
x € (0, 1) that is different from all the f(k)’s. How can we determine whether 
two numbers are different? One easy way is to look at their decimal expansions. 
For example, 0.1234 4 0.1254 because they have different decimal expansions. 
Thus, we could find an x that has no preimage under f by making the deci- 
mal expansion of x different by at least one digit from the decimal expansion of 
EVERY f (&). 

There is a flaw in this approach that we must fix. Decimal expansions are 
unique except for finite decimals, which always have an alternative expansion 
that terminates in 9s (e.g., 0.5 = 0.4999... and 0.24 = 0.23999...) (see Exer- 
cise 2.2.10). Hence, when specifying the decimal expansion of x, we must avoid 
decimals that terminate in 9s. 


Proof. Suppose that there is a 1-1 function f that takes N onto the interval 
(0, 1). Write the numbers f(j), j € N, in decimal notation, using the finite 
expansion when possible, that is, 


FQ) = Oayj012..., 
f (2) = 0.a21022..., 
FB) = Oe3ie3? 2; 


ery 
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where a;; represents the jth digit in the decimal expansion of f(i) and none 
of these expansions terminates in 9s. Let x be the number whose decimal 


expansion is given by 0.6; 82..., where 
pee ake + 1 if ag, <5 
aon ak — 1 if az, > 5. 


Clearly, x is a number in (0, 1) whose decimal expansion does not contain 
one 9, much less terminate in 9s. Since f is onto, there is a j € N such that 
fVU) = x. Since we have avoided 9s, the decimal expansions of f(j) and 
x must be identical (e.g., aj; = Bj := aj; +1). It follows thatO = +l,a 
contradiction. | 


It is natural to ask about the countability of the sets Z, Q, and R. To answer 
these questions, we prove several preliminary results. First, to show that a set 
E is at most countable, we do not need to construct a ONE-TO-ONE function 
which takes N onto E. 


1.40 Lemma. 
A nonempty set E is at most countable if and only if there is a function g from 
N onto E. 


Proof. If E is countable, then by Definition 1.38ii there is a (1-1) function f 
from N onto E,so g := f takes N onto E. If E is finite, then there is ann ¢ N 
and a 1-1 function f that takes {1,2,...,2} onto E. Hence 


oki wee 
FU) j>n 
takes N onto E. 

Conversely, suppose that g takes N onto E. We need to construct a function 
f that is 1-1 from some subset of N onto E. We will do this by eliminating the 
duplication in g. To this end, let k; = 1. Ifthe set FE) := {k EN: gtk) # g(k1)} 
is empty, then EF = {g(k,)}, thus evidently at most countable. Otherwise, let 
ky be the least element in £; and notice that ky > ky. 

Set Ey := {k EN: gtk) © E\{g(k1), g(ko)}}. If Eo is empty, then EF = 
{g(k1), g(k2)} is finite, hence at most countable. Otherwise, let k3 be the least 
element in Ez. Since g(k3) € E\{g(k1), g(k2)}, we have g(k3) 4 g(k2) and 
g(k3) # g(k1). Since g is a function, the first condition implies k3 4 k2. Since 
ky is least in E;, the second condition implies ky < k3. Hence, ky < kz < k3. 

Continue this process. If it ever terminates, then some 


Ej = {k EN: g(k) € E \ {gki),..-, skp} 


is empty, so E is finite, hence at most countable. If this process never ter- 
minates, then we generate integers kj < kz < --- such that kj+, is the least 
element of E; for j = 1,2,.... 
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Set f(j) = g(kj), j € N. To show that f is 1-1, notice that j ¢ ¢ implies 
that k; A ke, say kj < ke. Then k; < ke_1, so by construction 


g(ke) € E \ {g(k1), ..., 8(Kj), «++, 8(ke-1)} S E \ (8(k1), .--, ski}. 


In particular, g(ke) A g(k;); that is, f(€) A f(J). 

To show that f is onto, let x € E. Since g is onto, choose £ € N such that 
g(t) = x. Since by construction j < k;, use the Archimedean Principle to 
choose a j € N such that k; > €. Since k; is the least element in E;_1, it 
follows that g(¢) cannot belong to E\{g(ki), ..., g(kj—1)}; that is, g() = g(kn) 
for some n € [1, j — 1]. In particular, f(n) = g(k,) = x. a 


Next, we show how set containment affects countability and use it to answer 
the question about countability of R. 


1.41 Theorem. Suppose that A and B are sets. 


i) If A C B and B is at most countable, then A is at most countable. 
ii) If A C B and A is uncountable, then B is uncountable. 
iii) R is uncountable. 


Proof. i) Since B is at most countable, choose by Lemma 1.40 a function g 
which takes N onto B. We may suppose that A is nonempty, hence fix an 
ao € A. Then 


(n (nyEeA 
fay 2 ) 8 
ag snygAa 
takes N onto A. Hence by Lemma 1.40, A is at most countable. 
ii) If B were at most countable, then by part i), A would also be at most 
countable, a contradiction. 


iti) By Remark 1.39, the interval (0, 1) is an uncountable subset of R. Thus, 
by part ii), R is uncountable. | 


The following result shows that the Cartesian product of two countable sets is 
countable, and that a countable union of countable sets is countable. 


1.42 Theorem. Let A, A2,... be at most countable sets. 
i) Then A, x Az is at most countable. 


ii) If 
[o.@) 
E=|JA4; — LJ Aj = {x:x EA; forsome j €N}, 
j=l JEN 


then E is at most countable. 


Proof. i) By Lemma 1.40, there exist functions ¢ (respectively, y) which take 
N onto A, (respectively, onto Az). Hence f(n, m) := (¢(n), W(m)) takes NxN 
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onto A, x A2. If we can construct a function g which takes N onto N x N, then 
by Exercise 1.6.5a, fog takes N onto A; x Az. Hence by Lemma 1.40, A; x Az 
is at most countable. 

To construct the function g, plot the points of Nx N in the plane. Notice that 
we can connect these lattice points with a series of parallel backward-slanted 
lines; for example, the first line passes through (1, 1), the second line passes 
through (1, 2) and (2, 1), and the third line passes through (1, 3), (2, 2), and 
(3, 1). This suggests a method for constructing g. Set g(1) = d, 1), g(2) = 
(1,2), g3)=@2,), ¢4) =G,D,.... 

If you wish to see an explicit formula for g, observe that the nth line passes 
through the set of lattice points 


(1,n), (2,n — 1), 3,n —2),...,(a7 —1, 2), (n, 1); 


that is, through the set of lattice points (k, j) which satisfy k + j =n+1. 
Since the sum of integers 1 + 2+ --- + (n — 1) is given by (n — 1)n/2 (see 
Exercise 1.4.4a), there are (n — 1)n/2 elements in the first n — 1 slanted lines. 
Hence a function which takes N onto the nth slanted line is given by 


gs) =(.n+1—-£), (19) 


where j = €+ (n — 1)n/2. This function is defined on all of N because given 
j € N, wecan use the Archimedean Principle and the Well-Ordering Principle 
to choose n least such that j < n(n + 1)/2; that is, such that 7 = €+ (n — 1)n/2 
for some ¢ € [1,”]. Thus g takes N onto N x N. 

ii) By Lemma 1.40, choose functions f; that take N onto A;, j € N. Clearly, 
the function h(k, j) := f;,(j) takes N x N onto E. Hence the function h o g, 
where g is defined by (19), takes N onto E. We conclude by Lemma 1.40 that 
E is at most countable. | 


1.43 Remark. The sets Z and Q are countable, but the set of irrationals is 
uncountable. 


Proof. Z = NU (-N) U {0} and Q = UP. {p/n : p € Z} are both countable 
by Theorem 1.472ii. 

If R\Q were countable, then R = (R\Q) U Q would also be countable, a 
contradiction of Theorem 1.41iii. | 


EXERCISES 


1.6.0. Decide which of the following statements are true and which are false. 
Prove the true ones and give counterexamples to the false ones. 


a) Suppose that E is a set. If there exists a function f from E onto N, 
then E is at most countable. 

b) A dyadic rational is a point x € R such that x = n/2” for some n € Z 
and m €N. The set of dyadic rationals is uncountable. 
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1.6.1. 
1.6.2. 


1.6.3. 


1.6.4. 
1.6.5. 


1.6.6. 


1.6.7. 


The Real Number System 


c) Suppose that A and B are sets and that f : A > Bis 1-1. If A is 
uncountable, then B is uncountable. 
d) If £,, Eo,... are finite sets, and 


E:= Ej x Ey x +++ t= {(41, X2,...) xj © Ej for all j e N}, 


then E is countable. 


Prove that the set of odd integers {1, 3, ...} is countable. 

Prove that set of rational lattice points in space—that is, the set Q° := 
{(x, y,Z) : x, y, z € Q}—is countable. 

Suppose that A and B are sets and that B is uncountable. If there exists a 
function which takes A onto B, prove that A is uncountable. 

Suppose that A is finite and f is 1-1 from A onto B. Prove that B is finite. 
Let f: A— Band g: B > C anddefine go f: A> C by (go f)(x) := 
g(f(x)). 


a) Show that if f g are 1-1 (respectively, onto), then g o f is 1-1 (respec- 
tively, onto). 

b) Prove that if f is 1-1 from A into B and Bo := {y : y = f(x) for some 
x € A}, then f—! is 1-1 from Bo onto A. 

c) Suppose that g is 1-1 from B onto C. Prove that f is 1-1 on A (respec- 
tively, onto B) if and only if g o f is 1-1 on A (respectively, onto C). 


Suppose that n € Nand @: {1,2,...,n}— {1,2,..., 7}. 


a) Prove that ¢ is 1-1 if and only if ¢ is onto. 
b) [PIGEONHOLE PRINCIPLE] Suppose that E is a finite set and that f : E > 
E. Prove that f is 1-1 on E if and only if f takes E onto E. 


A number x € R is called algebraic of degree n if it is the root of a poly- 
nomial P(x) = adyx” + +--+ a,x + ao, where aj € Z, a, # 0, and n is 
minimal. A number xo that is not algebraic is called transcendental. 


a) Prove that ifn ¢ Nand gq € Q, then 74 is algebraic. 

b) Prove that for each n € N the collection of algebraic numbers of 
degree n is countable. 

c) Prove that the collection of transcendental numbers is uncountable. 
(Two famous transcendental numbers are z and e. For more informa- 
tion on transcendental numbers and their history, see Kline [5].) 


CHAPTER 2 


Sequences in R 


2.1 


LIMITS OF SEQUENCES 


An infinite sequence (more briefly, a sequence) is a function whose do- 
main is N. A sequence f whose terms are xn := f(n) will be denoted by 
X1,X2,... OF {Xp}nen OF {Xn}P°,, Or {xy}. Thus 1, 1/2, 1/4, 1/8, ... represents the 
sequence {1/2”~'}, <j; —1, 1, —1, 1, ... represents the sequence {(—1)"},cn; and 
1,2,3,4,... represents the sequence {n}nen. 

It is important not to confuse a sequence {x,},cn with the set {x, :n € N}; 
these are two entirely different concepts. For example, as sequences, 
1,2,3,4,... is different from 2, 1,3,4,..., but as sets, {1,2,3,4,...} is identi- 
cal with {2, 1,3,4,...}. Again, the sequence 1, —1,1, —1,... is infinite, but the 
set {(—1)” : n € N} has only two points. 

The limit concept is one of the fundamental building blocks of analysis. Recall 
from elementary calculus that a sequence of real numbers {x,,} converges to a 
number a if x, gets near a (i.e., the distance between a and x, gets small) as n 
gets large. Thus, given ¢ > 0 (no matter how small), ifn is large enough, |x» —a| 
is smaller than e. This leads us to a formal definition of the limit of a sequence. 


2.1 Definition. 


A sequence of real numbers {x,} is said to converge to a real number a € Rif 
and only if for every ¢ > 0 there is an N € N (which in general depends on ¢) 
such that 


n>WN implies |x, —al| <e. 


We shall use the following phrases and notation interchangeably: 


a) {x,} converges to a; b) x, converges to a; c) a = limy—+oo Xn; d) X, > a as 
n — oo; e) the limit of {x,} exists and equals a. 

When x, — a asn — ov, you can think of x, as a sequence of approximations 
to a, and e as an upper bound for the error of these approximations. The number 
N in Definition 2.1 is chosen so that the error is less than e whenn > N. In 
general, the smaller ¢ gets, the larger N must be. (See, for example, Figure 2.1.) 

Notice by definition that x, converges to a if and only if |x, — a| — 0 as 
n — oo. In particular, x, — 0 if and only if |x,| ~ Oasn > ~w. 

According to Definition 2.1, to prove that a particular limit exists, given an 
arbitrary ¢« > 0, no matter how small, we must describe how to choose an N such 
that n > N implies |x, — a| < e. In particular, ¢ is usually introduced BEFORE 
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FIGURE 2.1 


N is specified, and N often is defined to depend on «. Since |x, — a| < ¢ for all 
n > N,N CANNOT depend on n. 

Before we actually prove that some concrete limits exist, we introduce addi- 
tional terminology. Let P, be a property indexed by N. We shall say that Pp 
holds for large n if there is an N € N such that P,, is true for alln > N. Hencea 
loose summary of Definition 2.1 is that x, converges to a if and only if |x, —a| 
is small for large n. What we mean by this is that given any prescribed positive 
quantity ¢ (no matter how small), we can choose N large enough so that |x, —a| 
is less than ¢ for alln > N. 


2.2 EXAMPLE. 
i) Prove that 1/n > Oasn > oc. 


ii) If x, — 2, prove that (2x, + 1)/x, > 5/2 asn > oo. 


Proof. i) Let « > 0. Use the Archimedean Principle to choose N € N such 
that N > 1/e. By taking the reciprocal of this inequality, we see that n > N 
implies 1/n < 1/N < . Since 1/n are all positive, it follows that |1/n| < e for 
alln > N. 


Stratecy for ii): By definition, we must show that 


2n+tl 5 2-Xy 


Xn a Dye 


is small for large n. The numerator of this last fraction will be small for large n 
since x, — 2,aSn — oo. What about the denominator? Since x, — 2, x, will 
be greater than 1 for large n, so 2x, will be greater than 2 for large n. Since 
we made n large twice, we will make two restrictions to determine the N that 
corresponds to ¢ in Definition 2.1. Let’s try to write all this down carefully to 
be sure that it works out. 

ii) Let e > 0. Since x, — 2, apply Definition 2.1 to this e > 0 to choose 
N € Nsuch that n > N; implies |x, — 2| < ¢. Next, apply Definition 2.1 with 
€ = 1 tochoose N2 such that n > N> implies |x, —2| < 1. By the Fundamental 
Theorem of Absolute Values, we have n > N2 implies x, > 1 (i.e., 2x, > 2). 

Set N = max{N1, N2} and suppose that n > N. Since n > Nj, we have 
|2 — xn| = |x) — 2| < e. Since n > No, we have 0 < 1/(2x,) < 1/2 < 1. It 
follows that 


= & 


2x%,+1 5 |2 — x,| E 
— < 
Xn 2 2Xn 2X4 


for alln > N. | 
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Notice that in the proof of Remark 2.1 we forced two properties that held for 
n>Nj,j =1,2,toholdforn > N by setting N equal to the maximum of N; and 
Np. It is clear that by this same process, if Nj,..., Ng have been chosen so that 
for each j a property P; holds when n > N; and if N = max{N,..., Ng}, then 
all g properties P},..., Pg hold simultaneously when n > N. We shall use this 
device frequently below, but rarely write N explicitly as a maximum of integers 
Nj again. 

The following two results show that a given sequence can have no limits or 
one limit, but no more. 


2.3 EXAMPLE. 


The sequence {(—1)"}nen has no limit. 


Proof. Suppose that (—1)” > aasn — oo for somea € R. Given e = 1, 
there is an N € N such that n > N implies |(—1)" — a| < ¢. For n odd this 
implies |1 + a| = |—1-—al| < 1, and forn even this implies |1 — a| < 1. Hence, 


2=)|Per | S| alee Slap l = 2; 
that is, 2 < 2, a contradiction. | 
2.4 Remark. A sequence can have at most one limit. 


Proof. Suppose that {x,} converges to both a and b. By definition, given 
€ > 0, there is an integer N such that n > N implies |x, — a| < ¢/2 and 
[Xn — b| < €/2. Thus it follows from the triangle inequality that 


la — b| < |a —Xy| + [xn — B| <8; 
that is, |a — b| < e for all e > 0. We conclude, by Theorem 1.9,thata =b. Hf 


We shall use the following concept many times. 


2.5 Definition. 


By a subsequence of a sequence {x,}nen, we Shall mean a sequence of the form 
{Xn, }ken, Where each ny ¢ Nandn, < nz <---. 


Thus a subsequence xn,,%n),... Of x1, x2,... is obtained by “deleting” from 
X1,x2,... all x,’s except those such that n = nz for some k. For example, 1, 1,... 
is a subsequence of (—1)” obtained by deleting every other term (set nz = 2k), 
and 1/2, 1/4, ...is asubsequence of 1/n obtained by deleting all nondyadic frac- 
tions; that is, deleting 1/3, 1/5, 1/6, 1/7, ... (set nx = 2*). 
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Subsequences are sometimes used to correct a sequence that behaves badly or 
to speed up convergence of another that converges slowly. For example, {1/n} 
converges much more slowly to zero than its subsequence {1/2”}, and {(—1)”} 
does not converge at all (see Example 2.3 above), but its subsequence 1, 1,... 
converges to 1 immediately. 

If x, — aasn — oo, then the x,’s get near a as n gets large. Since nx gets 
large as k does, it comes as no surprise that any subsequence of a convergent 
sequence also converges. 


2.6 Remark. Jf {xn}nen converges to a and {xXn,}ken iS any subsequence of 
{Xn }nen, then xn, converges to aas k — ov. 


Proof. Let « > 0 and choose N € N such that n > N implies |x, — a| < e. 
Since nz € Nand ny < n2 < ---, it is easy to see by induction that nz > k for 
allk € N. Hence, k > N implies |xn, —a| < ¢; thatis,x,,-aask->o. 


The following concepts also play an important role for the theory of 
sequences. 


2.7 Definition. 
Let {x,} be a sequence of real numbers. 


i) The sequence {x,} is said to be bounded above if and only if the set {x, : 
n € N} is bounded above. 

ii) The sequence {x,} is said to be bounded below if and only if the set {x, : 
n € N} is bounded below. 

iii) {x,} is said to be bounded if and only if it is bounded both above and below. 


Combining Definitions 2.7 and 1.10, we see that {x,} is bounded above 
(respectively, below) if and only if there is an M € R such that x, < M for 
all n € N (respectively, if and only if there is an m € R such that x, > m for all 
n €N). It is easy to check (see Exercise 2.1.4) that {x,} 1s bounded if and only if 
there is a C > 0 such that |x,| < C for all € N. In this case we shall say that 
{x,} is bounded, or dominated, by C. 

Is there a relationship between convergent sequences and bounded 
sequences? 


2.8 Theorem. Every convergent sequence is bounded. 


StraTeGy: The idea behind the proof is simple (see Figure 2.1). Suppose that 
Xn > aasn — oo. By definition, for large N the sequence xy, xy+1,... must 
be close to a, hence bounded. Since the finite sequence x,,...,xy_ , is also 
bounded, it should follow that the whole sequence is bounded. We now make 
this precise. 
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Proof. Given ¢ = 1, there isan N € N such that n > N implies |x, — a| < 1. 
Hence by the triangle inequality, |x,| < 1+ |a| for alln > N. On the other 
hand, if 1 <n < N, then 


lxn| < M := max{|x1|, |x2|,..., len}. 


Therefore, {x,} is dominated by max{M, | + |a}}. | 


Notice that by Example 2.3, the converse of Theorem 2.8 is false. 


EXERCISES 


2.1.0. 


2.1.1. 


2.1.2. 


2.1.3. 


2.1.4. 


2.1.5. 


Decide which of the following statements are true and which are false. 
Prove the true ones and provide a counterexample for the false ones. 


a) If x, converges, then x,/n also converges. 

b) If x, does not converge, then x,/n does not converge. 

c) If x, converges and y, is bounded, then x, y, converges. 

d) If x, converges to zero and y, > 0 for alln EN, then x,y, converges. 


Using the method of Example 2.2, prove that the following limits exist. 


a) 3+1/n > 3asn > ow. 

b) m-1/fn>nmasn>o. 

c) —2(1+n/n?) > —2asn > ov. 

d) (3n? — 1)/6n? > 1/2asn > ov. 

Suppose that x, is a sequence of real numbers that converges to 1 as 


n — oo. Using Definition 2.1, prove that each of the following limits 
exists. 


a) 3-—4x, > —lasn > o. 
b) Gxy — 1)/xn > 3-1 asn > ~w. 
c) (x7 +2)/xn > 3.asn > ov. 


For each of the following sequences, find two convergent subsequences 
that have different limits. 


a) 2(-1)" 

b) 2+(-1)” 

c) (n+2-—(-1)"n)/n 
Suppose that x, € R. 


a) Prove that {x,} is bounded if and only if there is a C > O such that 
|Ix,| < C foralln EN. 

b) Suppose that {x,} is bounded. Prove that x,/n* > 0, asn — ov, for 
allk EN. 


Let C be a fixed, positive constant. If {b,} is a sequence of nonnegative 
numbers that converges to 0, and {x,} is a real sequence that satisfies |x, — 
a| < Cb, for large n, prove that x, converges to a. 
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2.1.6. Let a be a fixed real number and define x, := a forn € N. Prove that the 
“constant” sequence x, converges. 
2.1.7. a) Suppose that {x,} and {y,} converge to the same real number. Prove 
that x) — y, > Oasn > oo. 
b) Prove that the sequence {n} does not converge. 
c) Show that there exist unbounded sequences x, #4 yn which satisfy the 
conclusion of part (a). 
2.1.8. Suppose that {x,} is a sequence in R. Prove that x, converges to a if and 
only if EVERY subsequence of x, also converges to a. 


2.2 LIMIT THEOREMS 


One of the biggest challenges we face (both for theory and applications) is decid- 
ing whether or not a given sequence converges. Once we know that it converges, 
we can often use other techniques to approximate or evaluate its limit. 

One way to identify convergent sequences is by comparing a sequence whose 
convergence is in doubt with another whose convergence property is already 
known (see Example 2.10). The following result is the first of many theorems 
that addresses this issue. 


2.9 Theorem. [SQUEEZE THEOREM]. 
Suppose that {xn}, {yn}, and {wy} are real sequences. 


i) If X, > aand y, > a (the SAME a) asn —> om, and if there is an No € N 
such that 


Xn < Wn <n forn> No, 


then Wy, > aasn —> o. 
ii) If xX, > Oasn — oo and {yp} is bounded, then xn yn, > Oasn > oo. 


Proof. i) Let ¢ > 0. Since x, and y, converge to a, use Definition 2.1 and 
Theorem 1.6 to choose Nj, Nz € N such that n > N; implies —e < x, —a <é 
and n > N> implies —e < y, —a < €. Set N = max{No, Nj, No}. Ifn > N, we 
have by hypothesis and the choice of N; and N2 that 


A-E<Xp SS Wn Sn < a+ 8€; 


that is, |w, — a| < e forn > N. We conclude that w, > aasn > o. 
ii) Suppose that x, — 0 and that there is an M > 0 such that |y,;| < M for 
n €N. Let e > 0 and choose an N € N such that n > N implies |x,| < e/M. 
Then n > N implies 
€ 
IXnYnl < Ma = €. 
We conclude that x,y, > 0asn > oo. | 


The following example shows how to use the Squeeze Theorem to find the 
limit of a complicated sequence by ignoring its “less important” factors. 
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2.10 EXAMPLE. 


Find lim,_.o5 27” cos(n? — n? +n — 13). 


Solution. The factor cos(n? —n* +n—13) looks intimidating, but it is superfluous 
for finding the limit of this sequence. Indeed, since |cosx| < 1 for all x € R, 
the sequence {2~” cos(n>? — n? +n — 13)} is dominated by 2-”. Since 2” > n, 
it is clear by Example 2.2i and the Squeeze Theorem that both 2~” — 0 and 
2-" cos(n? — n* +n — 13) > Oasn > ov. | 


The Squeeze Theorem can also be used to construct convergent sequences 
with certain properties. To illustrate how this works, we now establish a result 
that connects suprema and infima with convergent sequences. 


2.11 Theorem. Let E CR. /f E has a finite supremum (respectively, a finite 
infimum), then there is a sequence xn € E such that x, — sup E (respectively, a 
sequence yy € E such that y, — inf E) asn > ov. 


Proof. Suppose that E has a finite supremum. For each n € N, choose (by 
the Approximation Property for Suprema) an x, € E such that sup E —1/n < 
X, < sup E. Then by the Squeeze Theorem and Example 2.2i, x, — sup E as 
n — oo. Similarly, there is a sequence y, € FE such that y, — inf E. | 


Here is another result that helps to evaluate limits of specific sequences. This 
one works by viewing complicated sequences in terms of simpler components. 


2.12 Theorem. Suppose that {x,} and {y,} are real sequences and that a € R. If 
{xn} and {y,} are convergent, then 


i) lim (&) + yn) = lim x, + lim yy, 
n—->Oo n—>Co n—>Co 
ii) lim (a@x,) =a lim xp, 
n—->Co n—->Co 
and 
iti) lim (Gyn) = (lim x,)( lim yy). 
n—->Co n—>Co n—>Co 
If, in addition, yy, # 0 and limp-+oo yn & 0, then 
Xn limn— oo Xn 


iv) lim = . 
) nO Vn limp—oo Yn 


(In particular, all these limits exist.) 
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Proof. Suppose that x, > x and y, > y asn > ©. 


i) Let e > 0 and choose N € N such that n > N implies |x, — x| < e/2 and 
lyn — y| < €/2. Thus n > N implies 


E 
(Xn + Yn) — (& + y)| S len — X14 1 Yn Ms hee 


ii) It suffices to show that ax, — ax > Oasn —> oo. But x, —x > Oas 
n — oo, hence by the Squeeze Theorem, a(x, — x) > Oasn —> ov. 

iii) By Theorem 2.8, the sequence {x,} is bounded. Hence by the Squeeze 
Theorem the sequences {xn (yn — y)} and {(x, —x)y} both converge to 0. Since 


XnYn — XY =Xn(Vn — Y) + (Xn — X)Y, 


it follows from part i) that x,y, — xy asn — oo. A similar argument estab- 
lishes part iv) (see Exercise 2.2.4). a 


Theorem 2.12 can be used to evaluate limits of sums, products, and quotients. 
Here is a typical example. 


2.13 EXAMPLE. 
Find limp_.o0(n? +n? — 1)/(1 — 3n?). 


Solution. Multiplying the numerator and denominator by 1/n?, we find that 


m+n?-1  1+(/n)—(1/n°) 
1-3n3 (/n3)-3 


By Example 2.2i and Theorem 2.12iii, 1/n* = (1/n)* > 0, asn — ov, for any 
k EN. Thus by Theorem 2.12i, ii, and iv, 


lim = = ; a 


The sequence {log 7},<n fails to converge in a different way than {n(—1)"},en 
does. Indeed, the terms logn get steadily larger as n — ov, but the terms n(—1)” 
bounce back and forth between large positive values and large negative values. 
It is sometimes convenient to emphasize this difference by generalizing limits to 
include extended real numbers. 


2.14 Definition. 


Let {x,} be a sequence of real numbers. 


i) {xp} is said to diverge to +oo (notation: x, > +-oo asn — 00 or limy-+o0 Xn 
= +00) if and only if for each M € R there is an N € N such that 


n>WN implies x, > M. 
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ii) {x,} is said to diverge to —co (notation: x, — —oo asn > 00 or limy-+00 Xn 
= —oo) if and only if for each M € R there is an N € N such that 


n>WN implies x, < M. 


Notice by Definition 2.141 that x, — +00 if and only if given M ¢€ R, x, 
is greater than M for sufficiently large n; that is, eventually x, exceeds every 
number M (no matter how large and positive M is). Similarly, x, — —oo if 
and only if x, eventually is less than every number M (no matter how large and 
negative M is). 

It is easy to see that the Squeeze Theorem can be extended to infinite limits 
(see Exercise 2.2.7). The following is an extension of Theorem 2.12. 


2.15 Theorem. Suppose that {x,} and {y,} are real sequences such that x, —> 
+oo (respectively, x, — —0o) asn > oo. 


i) If y, is bounded below (respectively, yn is bounded above), then 
lim (%, + yn) = +00 (respectively, lim (%, + yn) = —Ov). 
noo noo 


ii) Ifa > 0, then 


lim (@x,) =+00 (respectively, lim (ax,) = —00). 
no n—>oo 


ii) If yn > Mo for some Mo > O and alln €N, then 
lim (X%nyn) = +00 (respectively, lim (xXnyn) = —00). 
now nw 


iv) If {yn} is bounded and xn # 0, then 


Proof. We suppose for simplicity that x, — +-oo asn > oo. 


i) By hypothesis, y, > Mo for some Mo € R. Let M € Rand set M; = M— Mo. 
Since x, — +00, choose N € N such that n > N implies x, > M,. Then 
n > N implies x, + y, > M, + Mp = M. 

ii) Let M € Rand set Mj; = M/a. Choose N € N such that n > N implies 
X, > M,. Since a > 0, we conclude that ax, > aM; = M foralln>N. 

iti) Let M € Rand set Mj = M/Mo. Choose N é€ N such that n > N implies 
Xn > M,. Thenn > N implies x,y, > M1 Mo = M. 

iv) Let ¢ > 0. Choose Mp > 0 such that |y,| < Mo and M, > 0 so large that 
Mo/M, < €. Choose N € N such that n > N implies x, > M,. Thenn > N 
implies 


Yn 
Xn 


lyn Mo 
= < <E 
Xn M, 
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If we adopt the conventions 


x+tCoO=O0, X-W=-O, x ER, 

X-CO=O, X-(-—Ww)=-N, x > 0, 

X-CO=-O, X-(-W)=N, x <0, 

Coto =MO, -WM-NW=-N, 

CO: 00 = (—oo):-(—0o0o) = oo, and o-(—o) =(—ow)-w=-wow, 


then Theorem 2.15 contains the following corollary. 


2.16 Corollary. Let {xn}, {yn} be real sequences and a, x, y be extended real num- 
bers. If x, — x and yn > y, asn — o, then 


lim (%n + yn) =x +y 
nw 
provided that the right side is not of the form co — oo, and 
lim (@x,)) = ax, lim (XnYn) = xy 
now nw 
provided that none of these products is of the form 0 - oo. 


We have avoided the cases co — oo and 0 - too because they are “inde- 
terminate.” For a discussion of indeterminate forms, see |’H6pital’s Rule in 
Section 4.4. 

Theorems 2.12 and 2.15 show how the limit sign interacts with the algebraic 
structure of R. (Namely, the limit of a sum (product, quotient) is the sum (prod- 
uct, quotient) of the limits.) The following theorem shows how the limit sign 
interacts with the order structure of R. 


2.17 Theorem. [COMPARISON THEOREM]. 
Suppose that {x,} and {y,} are convergent sequences. If there is an No € N such 
that 


Xn < yn forn > No, (1) 
then 


lim x, < lim yp. 
n—->OCo n—->Co 


In particular, if x, € [a,b] converges to some point c, then c must belong to 


[a, b]. 


Proof. Suppose that the first statement is false; that is, that (1) holds but x := 
limy—+oo Xn is greater than y := lim, +9 yy. Set e = (x—y)/2. Choose Nj > No 
such that |x, — x| < ¢ and |y, — y| < ¢ forn > N,. Then for such an n, 


x—y My 
Xn > X-E=X ( 5) )a=v4 (2) or tee 
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which contradicts (1). This proves the first statement. 
We conclude by noting that the second statement follows from the first, 
since a < x, < bimpliesa <c <b. | 


One way to remember this result is that it says the limit of an inequality is 
the inequality of the limits, provided these limits exist. We shall call this process 
“taking the limit of an inequality.” Since x, < y, implies x, < y,, the Compar- 
ison Theorem contains the following corollary: If {x,} and {y,} are convergent 
real sequences, then 


Xn<Yn, n>No, imply lim x, < lim yy. 
now nwo 
It is important to notice that this result is false if < is replaced by <; that is, 


Xn <Yn, n>WNo, doesNOTimplythat lim x, < lim y. 
n—>Co n—>Co 


For example, 1/n* < 1/n, but the limits of these sequences are equal. 


EXERCISES 


2.2.0. Determine which of the following statements are true and which are 
false. Prove the true ones and provide counterexamples for the false 
ones. 


a) If x, > co and y, > —oo, then x, + y, > Oasn > oc. 
b) Ifx, > —ov, then 1/x, > 0asn > oo. 

c) If x, > 0, then 1/x, > co asn > oo. 

d) If x, — ov, then (1/2)*" > Oasn > oo. 


2.2.1. Prove that each of the following sequences converges to zero. 


a) xX, = cos (logn + 5n> + 3)/n 
b) xn = (2n? + 1)/(3n* + 1) 
C) = Vn+ T/(n + V3) 
d) x, = 2n/3" 
2.2.2. Use Definition 2.14 to prove that each of the following sequences 
diverges to +00 or to —oo. 
a) xX, =n? —n 
b) xn =n —3n? 
nti 


c) Xn = 
n 
d) x, =n?(2+sin(n? +n+1)) 
2.2.3. Find the limit (if it exists) of each of the following sequences. 


a) X, = (5n? + 3n — 1)/(—6n? + 4n — 2) 
b) Xn = (n* —1)/(n* —n? — 1) 
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C) penta /n 

d) Xn = (V25n +1 — J/n)/(/4n — 1+ Vn — 2) 
2.2.4. a) Prove Theorem 2.12iv. 

b) Prove Corollary 2.16. 


2.2.5. Suppose that x € R, x, > 0, and x, — x asn — oo. Prove that 
JxXn —> J/x asn — ov. [For the case x = 0, use inequality (8) in 
Section 1.2.] 

2.2.6. Prove that given x € R there is a sequence r, € Q such that r, — x as 
n> o. 

2.2.7. Suppose that x and y are extended real numbers and that {x,}, {yn}, and 
{w,} are real sequences. 


a) [SQUEEZE THEOREM FOR R]. If x, — x and y, > x, asn — oo, and 
Xn < Wn < yn forn EN, prove that wy, > x asn > oo. 

b) [Comparison THEOREM FoR R]. If x, > x and y, > y, asn > o, 
and x, < y, forn EN, prove that x < y. 


2.2.8. Using the result in Exercise 2.2.5, prove the following results. 


a) Suppose that 0 < x; < landx,4; =1—J/1—x, forn EN. Ifx, > x 
asn —> oo, then x =Oor 1. 

b) Suppose that x; > 3 and x,4; =24+ /x, —2forn EN. If x, > x as 
n — oo, then x = 3. 

(c) Suppose that x1 > 0 and x4; = /2+%x, forn € N. If x, > x as 
n — oo, then x = 2. What happens if x; > —2? 


2.2.9|. This exercise was used in Section 1.6. 


a) Suppose that 0 < y < 1/10” for some integer n > 0. Prove that there 
is an integer 0 < w < 9 such that 


Ja w 1 
Tortt =) < Toret TF [get 


b) Prove that given x € [0, 1) there exist integers 0 < x, < 9 such that 


for alln EN, 
2 Xk t Xk 1 
SS eee te 
k — k 
= 10 rae 10 10” 
c) Prove that given x € [0, 1) there exist integers 0 < x, <9, k EN, 
such that 


n 
; Xk 
ioe 
k=1 
d) Using part c), prove that 0.5 = 0.4999... and 1 = 0.999... 


NOTE: The numbers x, are called digits of x, and 0.x;x2...is called a decimal 
expansion of x. Unless x is a rational number whose denominator is of the form 
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2'5/ for some integers i > 0, j > 0, this expansion is unique; that is, there is 
only one sequence of integers {x;} that satisfies part (c). On the other hand, if 
x is a rational number whose denominator is of the form 2'5/, then there are 
two sequences {x,} that satisfy part (c), one that satisfies x, = 0 for large k and 
one that satisfies x, = 9 for large k (see part d). We shall identify the second 
sequence by saying that it terminates in 9s. 


2.3. BOLZANO-WEIERSTRASS THEOREM 


Notice that although the sequence {(—1)”} does not converge, it has conver- 
gent subsequences. In this section we shall prove that this is a general principle. 
Namely, we shall establish the Bolzano—Weierstrass Theorem, which states that 
every bounded sequence has a convergent subsequence. 

We begin with a special case (monotone sequences) for which the Bolzano- 
Weierstrass Theorem is especially transparent. Afterward, we shall use this spe- 
cial case to obtain the general result. 


2.18 Definition. 
Let {xn }nen be a sequence of real numbers. 


i) {xp} is said to be increasing (eapecHively, strictly increasing) if and only if 
xX} <x2 <--- (respectively, xj < x2 <---). 
ii) {x,} is said to be decreasing freapectively, strictly decreasing) if and only if 
xX; > x2 >--- (respectively, x1 > x2 >---). 
iti) {x} is said to be monotone if and only if it is either increasing or 
decreasing. 


(Some authors call decreasing sequences nonincreasing and increasing 
sequences nondecreasing.) 


If {x,} is increasing (respectively, decreasing) and converges to a, we shall 
write x, t a (respectively, x, | a), asn — oo. Clearly, every strictly increas- 
ing sequence is increasing, and every strictly decreasing sequence is decreasing. 
Also, {xn} is increasing if and only if the sequence {—x,} is decreasing. 

By Theorem 2.8, any convergent sequence is bounded. We now establish the 
converse of this result for monotone sequences. (For an extension to extended 
real numbers, see Exercise 2.3.6.) 


2.19 Theorem. [MONOTONE CONVERGENCE THEOREM]. 
If {xn} is increasing and bounded above, or if {xn} is decreasing and bounded 
below, then {x,} converges to a finite limit. 


Proof. Suppose first that {x,} is increasing and bounded above. By the Com- 
pleteness Axiom, the supremum a := sup{x, : n € N} exists and is finite. Let 
€ > 0. By the Approximation Property for Suprema, choose N € N such that 


a-—-E<Xn <a. 
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Since xy < x, forn > N and x, <a foralln €N, it follows thata—e < x, <a 
for alln > N. In particular, x, + a asn —> oo. 

If {x,} is decreasing with infimum b := inf{x, : n € N}, then {—x,} is 
increasing with supremum —b (see Theorem 1.20). Hence, by the first case 
and Theorem 2.1211, 


b=—(—b)= jim ( Xn) = (dim Xn. a 


The Monotone Convergence Theorem is used most often to show that a limit 


exists. Once existence has been established, it is often easy to find the value 
of that limit by using Theorems 2.9 and 2.12. The following examples illustrate 
this fact. 


2.20 EXAMPLE. 


If 


la| < 1, thena” > Oasn > ow. 


Proof. It suffices to prove that |a|" > 0 asn — oo. First, we notice that |a|” 
is monotone decreasing since by the Multiplicative Property, |a| < 1 implies 
|a|"+! < |a|" for all n € N. Next, we observe that |a|” is bounded below (by 
0). Hence by the Monotone Convergence Theorem, L := limy- 0 |a|” exists. 

Take the limit of the algebraic identity |a|"t! = |a| - |a|", asn — oo. By 
Remark 2.6 and Theorem 2.12, we obtain L = |a|-L. Thus either L = 0 or 
|a| = 1. Since |a| < 1 by hypothesis, we conclude that L = 0. a 


2.21 EXAMPLE. 


If 


a > 0,thena!/” + lasn > ow. 


Proof. We consider three cases. 

Case 1. a = 1. Thena!/” = 1 for alln EN, and it follows that a!/” > 1 as 
n> oO. 

Case 2. a > 1. We shall apply the Monotone Convergence Theorem. To 
show that {a!/”} is decreasing, fix n € N and notice that a > 1 implies a"t! > 
a”. Taking the n(n + 1)st root of this inequality, we obtain al? sa gilGrh, 
that is, a!/” is decreasing. Since a > 1 implies a!/” > 1, it follows that a!/” 
is decreasing and bounded below. Hence, by the Monotone Convergence 
Theorem, L := lim,+..a!/" exists. To find its value, take the limit of the 
identity (a!/@”)2 = q!/" asn — oo. We obtain L? = L; that is, L = 0 or 1. 
Since a!/” > 1, the Comparison Theorem shows that L > 1. Hence L = 1. 

Case 3.0 <a < 1. Then 1/a > 1. It follows from Theorem 2.12 and Case 
2 that 

lim a/" = lim = : 1 Z 
n> oo n—>0o 1/al/n limyoo(1/a)!/* . 


Next, we introduce a monotone property for sequences of sets. 
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2.22 Definition. 


A sequence of sets {In}nen is Said to be nested if and only if 


Hobs, 


In Chapters 3, 8, and 9, we shall use this concept to study continuous func- 
tions. Here, we use it to establish the Bolzano—Weierstrass Theorem. All of 
these applications depend in a fundamental way on the following result. 


2.23 Theorem. [NESTED INTERVAL PROPERTY]. 

Tf Un}nen is anested sequence of nonempty closed bounded intervals, then E := 
1 tn is nonempty. Moreover, if the lengths of these intervals satisfy |I,| > 0 
as n — ©, then E is a single point. 


Proof. Let I, = [an, by]. Since {7,} is nested, the real sequence {a,} is increas- 
ing and bounded above by b;, and {b,,} is decreasing and bounded below by a, 
(see Figure 2.2). Thus by Theorem 2.19, there exist a,b € R such that an, + a 
and b, | basn — oo. Since a, < by, for alln € N, it also follows from the 
Comparison Theorem that a, < a < b < by. Hence, a number x belongs to 
I, for alln € N if and only if a < x < b; that is, if and only if x € [a,b]. In 
particular, any x € [a, b] belongs to all the [,,’s. 


a ee | 

[ F SS 1 | 

ay ay a3, ay...a b... by by by by 
FIGURE 2.2 


We have proved that there is exactly one number that belongs to all the /,,’s 
if and only if a = b. But if |J,| > 0asn > ov, then b, — a, > Oasn > ow. 
Hence, by Theorem 2.12, a does equal b when |/,| — 0asn > ow. | 


The next two results show that neither of the hypotheses of Theorem 2.23 can 
be relaxed. 


2.24 Remark. The Nested Interval Property might not hold if “closed” is 
omitted. 


Proof. The intervals [, = (0,1/n), n € N, are bounded and nested but not 
closed. If there were an x € /, for alln € N, then 0 < x < 1/n; thatis,n < 1/x 
for alln € N. Since this contradicts the Archimedean Principle, it follows that 
the intervals /,, have no point in common. | 


2.25 Remark. The Nested Interval Property might not hold if “bounded” is 
omitted. 
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Proof. The intervals J, = [n,co), n € N are closed and nested but not 
bounded. Again, they have no point in common. a 


We are now prepared to establish the main result of this section. 


2.26 Theorem. [BOLZANO-WEIERSTRASS THEOREM]. 
Every bounded sequence of real numbers has a convergent subsequence. 


Proof. We begin with a general observation. Let {x,} be any sequence. If 
E = AUB are sets and E contains x, for infinitely many values of n, then at 
least one of the sets A or B also contains x, for infinitely many values of n. (If 
not, then EF contains x, for only finitely many n, a contradiction.) 

Let {x,} be a bounded sequence. Choose a, b € R such that x, € [a, b] for 
alln € N, and set Jp = [a, b]. Divide Jp into two halves, say I’ = [a, (a + b)/2] 
and I” = [(a + b)/2, b]. Since Ip = I’ UI", at least one of these half-intervals 
contains x, for infinitely many n. Call it 1;, and choose n; > 1 such that 
Xn, € 11. Notice that |1)| = |Jo|/2 = (b — a)/2. 

Suppose that closed intervals Jj D I; D ... D J and natural numbers 
nt <Nnz <... <p» have been chosen such that for each 0 < k < m, 


—a 


b 
el = 


Xn, € le, and xy € J for infinitely many n. (2) 


To choose I41, divide Im = [am, bm] into two halves, say I’ = [am, (am + 
bm)/2] and I” = [(an+bm)/2, bm]. Since Im = I'UI", at least one of these half- 
intervals contains x, for infinitely many n. Call it 1,41, and choose ny,41 > Mm 
such that xn,,., € Im41. Since 


[ml ba 
mil = > = Saar 
it follows by induction that there is a nested sequence {Jx},cen of nonempty 
closed bounded intervals that satisfy (2) for all k € N. 
By the Nested Interval Property, there is an x € R that belongs to /; for all 
k EN. Since x € Ig, we have by (2) that 


—a 
O< |xn, —x| < lel 5: 

for all k € N. Hence by the Squeeze Theorem, x», — x ask > oo. gz 
EXERCISES 


2.3.0. Decide which of the following statements are true and which are 
false. Prove the true ones and provide counterexamples for the false 
ones. 


a) If x, is strictly decreasing and 0 < x, < 1/2, then x, > Oasn > oo. 


2.3.1 


2.3.2. 


2.3.3. 


2.3.4. 


2.3.5. 


2.3.7 


2.3.8. 


Section 2.3 Bolzano-Weierstrass Theorem 71 
b) If 


_ @—1)cos(n? +n +1) 
= 2n—1 ; 


n 


then x, has a convergence subsequence. 
c) If x, is a strictly increasing sequence and 


1 
lxn| < 1+ — 
n 


forn =1,2,...,thenx, > lasn > co. 
d) If x, has a convergent subsequence, then x, is bounded. 


Suppose that xo € (0, 1) and x, = 2./x,_1/4 for n € N. Prove that x, + 1 
as n —> oo. What happens when xo € [0, 1]? 

Suppose that 0 < x1 < 1 and x,4; = 1— /l—x, forn € N. Prove 
that x, | Oasn > oo and x41/%, > 1/2, asn > oo. (Exercise 4.3 in 
Apostol [1].) 

Suppose that x9 > 1 and x, = /x,-1 —14+ 1 forn € N. Use the Mono- 
tone Convergence Theorem to prove that either x, — 1 or x, > 2 as 
n—-> Oo. 

Suppose that x9 € Rand x, = (4x%,-1+1)/5 forn € N. Use the Monotone 
Convergence Theorem to prove that x, > lasn > oo. 


Prove that 
1 x>0O 
lim x!/@7-) = 26 x=0 
n—>Co 
—l x <0. 


. This result is used in Section 6.3 and elsewhere. 


a) Suppose that {x,} is a monotone increasing sequence in R (not neces- 
sarily bounded above). Prove that there is an extended real number 
x such that x, — x aSn —> oo. 

b) State and prove an analogous result for decreasing sequences. 


Suppose that E Cc R is a nonempty bounded set and that supE ¢ E. 
Prove that there exists a strictly increasing sequence {x,} that converges 
to sup E such that x, € E foralln EN. 

Let 0 < y; < x, and set 


Xn + 
— and = yn41 = SXnYn> neN. 


Xn+1 = 


a) Prove that 0 < y, < x, foralln EN. 

b) Prove that y, is increasing and bounded above, and that x, is decreas- 
ing and bounded below. 

c) Prove that 0 < x41 — yas < (41 — y1)/2” forn EN. 
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d) Prove that limy+o%, = limp—soo yn. (This common value is called 
the arithmetic-geometric mean of x; and y.) 


2.3.9. Suppose that x9 = 1, yo = 0, 
Xn =Xn-1+2yn-1, and yn = Xn-1 + Yn-1 


forn €N. Prove that x? — 2y? = +1 forn ¢ Nand 


Xn 


Yn 


>v2 ano. 


2.3.10. [ARCHIMEDES] Suppose that x9 = 2V3, yo = 3, 


= 2Xn—1Yn—1 
‘ed ce ee 
Xn—-1 + Yn-1 


and 


Yn = VXnYn-1 


forn EN. 
a) Prove that x, | x and y, + y,asn — oo, forsome x,y ER. 
b) Prove that x = y and 


3.14155 < x < 3.14161. 
(The actual value of x is zr.) 


2.4 CAUCHY SEQUENCES 


In this section we introduce an extremely powerful and widely used concept. 

By definition, if {x,} is a convergent sequence, then there is a point a € Rsuch 
that x, is near a for large n. If the x,’s are near a, they are certainly near each 
other. This leads us to the following concept. 


2.27 Definition. 


A sequence of points x, € R is said to be Cauchy (in R) if and only if for every 
€ > O there is an N € N such that 


nsm>N imply |x, —%m| <. (3) 


The next two results show how this concept is related to convergence. 


2.28 Remark. /f {x,} is convergent, then {xy} is Cauchy. 
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Proof. Suppose that x, — aasn — oo. Then by definition, given ¢ > 0 there 
is an N € N such that |x, — a| < ¢/2 for alln > N. Hence ifn,m > N, it 
follows from the triangle inequality that 


Xn — Xm| S [Xn — al + [Xm og Se a 


The following result shows that the converse of Remark 2.28 is also true (for 
real sequences). 


2.29 Theorem. [CAUCHY]. 
Let {x,} be a sequence of real numbers. Then {xy} is Cauchy if and only if {xy} 
converges (to some point a in R). 


STRATEGY: By Remark 2.28 we need only show that every Cauchy sequence 
converges. Suppose that {x,} is Cauchy. Since the x,’s are near each other, the 
sequence {x,} should be bounded. Hence, by the Bolzano—Weierstrass Theo- 
rem, {xn} has a convergent subsequence, say x,,. This means that for large k, 
the x,,’s are near some point a € R. But since {x,} is Cauchy, the x,’s should 
be near the x,,’s for large n, hence also near a. Thus the full sequence should 
converge to that same point a. Here are the details. 


Proof. Suppose that {x,} is Cauchy. Given e = 1, choose N é€ N such that 
[xv — Xm| < 1 for allm > N. By the triangle inequality, 


lXm| < 1+ |xn| form > N. 


Therefore, {x,} is bounded by M = max{|xj|, |x2|,..., [xy—1], 1 + |xn]}. 
By the Bolzano—Weierstrass Theorem, {x,} has a convergent subsequence, 
Say Xn, > aask — oo. Lete > 0. Since x, is Cauchy, choose N; € N such that 


; € 
n,m>N, imply tn — Xml < 5. 


Since xn, — a ask — oo, choose N2 € N such that 


€ 
k>N2 implies |xn, —a| < 5° 


Fix k > No such that ny > N;. Then 
Xn — al < [Xn — Xn, | + len, — al < € 
for alln > Ny. Thus x, > aasn > oo. a 


This result is extremely useful because it is often easier to show that a 
sequence is Cauchy than to show that it converges. The reason for this, as the 
following example shows, is that we can prove that a sequence is Cauchy even 
when we have no idea what its limit is. 
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2.30 EXAMPLE. 


Prove that any real sequence {x,,} that satisfies 


1 
tn — Xntil S sy neéN, 
is convergent. 
Proof. Ifm > n, then 
Xn — Xml = en — Xn41 + Xnt1 — Xn42 + +++ + Xm—-1 — Xml 
S [Xn — Xn41] + lXnt1 — Xng2| + +++ + m-1 — Xml 
1 1 


1 
wee! =a): 


(The last step uses Exercise 1.4.4c, for a = 2.) It follows that |x, — xm| < 
1/2”! for all integers m > n > 1. But given ¢ > 0, we can choose N € N so 
large that n > N implies 1/2"~! < e. We have proved that {x,} is Cauchy. ge 
Theorem 2.29, therefore, it converges to some real number. 


The following result shows that a sequence is not necessarily Cauchy just 
because x, is near x,+1 for large n. 


2.31 Remark. A sequence that satisfies xn+41 — xn — 0 is not necessarily Cauchy. 


Proof. Consider the sequence x, := logn. By basic properties of logarithms 
(see Exercise 5.3.7), 


Xn+1 — Xp» = log(n + 1) — logn = log((n+ 1)/n) > log1 =0 


asin — oo. {x,} cannot be Cauchy, however, because it does not converge; in 
fact, it diverges to +00 asn > oo. 


EXERCISES 


2.4.0. Decide which of the following statements are true and which are false. 
Prove the true ones and provide a counterexample for the false ones. 


a) If {x,} is Cauchy and {y,} is bounded, then {xy y,} is Cauchy. 

b) If {x,} and {y,} are Cauchy and y, 4 0 for alln € N, then {x,/yp} is 
Cauchy. 

c) If {x,} and {y,} are Cauchy and x, + y, > 0 for alln EN, then {1/(x,) + 
Yn) cannot converge to zero. 
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d) If {x,} is a sequence of real numbers that satisfies x3 — x>x-1 > O as 
k > ooandif x, = 0 for alln 4 2*, k EN, then {x,} is Cauchy. 


2.4.1. Prove that if {x,,} is a sequence that satisfies 
Len 5n?* + 3n+7 
ate aA +4n?—1 


for alln € N, then {x,,} is Cauchy. 
2.4.2 


e 


Suppose that x, € Zforn € N. If {x,} is Cauchy, prove that x, is eventu- 


ally constant; that is, that there exist numbers a € Z and N &€ N such that 


x, =aforalln>N. 


2.4.3 


. 


a) Without using Theorem 2.29, prove that ax, is Cauchy. 
b) Without using Theorem 2.29, prove that x, + yy, is Cauchy. 
c) Without using Theorem 2.29, prove that x» y, is Cauchy. 


Suppose that x, and y, are Cauchy sequences in R and thata € R. 


2.4.4, Let {x,} be a sequence of real numbers. Suppose that for each ¢ > 0 there 
isan N € N such that m > n > N implies baa xx < e. Prove that 


n 
lim ae 
noo 
k=1 


exists and is finite. 
2.4.5. Prove that limy—oo )-7—1(— 1)‘/k exists and is finite. 


2.4.6. Let {x,} be a sequence. Suppose that there is ana € (0, 1) such that 


IXn41 —Xn| X a” 


for alln € N. Prove that x, — x for some x € R. 


2.4.7. a) Let E be a subset of R. A point a € Ris called a cluster point of E if 
EN (a—r,a-+r) contains infinitely many points for every r > 0. Prove 
that a is a cluster point of E if and only if for eachr > 0, EN (a — 


r,a+r)\{a} is nonempty. 


b) Prove that every bounded infinite subset of R has at least one clus- 


ter point. 


2.4.8 a) A subset E of R is said to be sequentially compact if and only if every 
sequence x, € E has a convergent subsequence whose limit belongs to 

E. Prove that every closed bounded interval is sequentially compact. 
b) Prove that there exist bounded intervals in R that are not sequentially 


compact. 


c) Prove that there exist closed intervals in R that are not sequentially 


compact. 


*2.5 LIMITS SUPREMUM AND INFIMUM 


This section uses no material from any other enrichment section. 
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In some situations (e.g., the Root Test in Section 6.3), we shall use the follow- 
ing generalization of limits. 


2.32 Definition. 


Let {x,} be a real sequence. Then the limit supremum of {x,} is the extended 
real number 


lim sup x, := lim (sup xx), (4) 
H=SOS NFO psy 


and the limit infimum of {x,} is the extended real number 


lim inf x, := lim (inf x,). 
noo n>oo k>n 


Before we proceed, we must show that the limits in Definition 2.32 exist as 
extended real numbers. To this end, let {x,} be a sequence of real numbers and 
consider the sequences 


Sy = supxz = sup{x,:k >n} and $f, = inf xp := inf{x, 2k > nh}. 
>n k>n 


Each s, and t, is an extended real number, and by the Monotone Property, sy is 
a decreasing sequence and f, an increasing sequence of extended real numbers. 
In particular, there exist extended real numbers s and ¢ such that s, | s and 
t, + t asSn — co (see Exercise 2.3.6). These extended real numbers are, by 
Definition 2.32, the limit infimum and limit supremum of the sequence {x,}. 

Here are two examples of how to compute limits supremum and limits 
infimum. 


2.33 EXAMPLE. 


Find lim sup, _,.o %n and lim infy_, 90 Xp, if x, = (-1)”. 


Solution. Since supp» (—1)* = | for alln EN, it follows from Definition 2.32 
that lim sup,,_, 45% = 1. Similarly, lim inf,_, 9 x, = —I. a 
2.34 EXAMPLE. 

Find lim sup,,_, 45 Xn and lim infy_, 99 X%, if x, = 14+ 1/n. 


Solution. Since sup,.,(1 + 1/k) = 14+ 1/n for alln €N, lim sup, _,.5 41 = 1. 
Since infys,(1 + 1/k) = 1 for alln EN, lim infy_,o0 X% = 1. a 


These examples suggest that there is a connection among limits supremum, 
limits infimum, and convergent subsequences. The next several results make 
this connection clear. 
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2.35 Theorem. Let {x,} be a sequence of real numbers, s = lim sup, _, 4 Xn, and 
t = lim infy—o0 Xn. Then there are subsequences {Xn,}ken and {xe;}jen such that 
Xn, > Sask > oo and x¢; >tasjroo. 


Proof. We will prove the result for the limit supremum. A similar argument 
establishes the result for the limit infimum. Let s, = sup;.,, x, and observe 
that s, | s asn —> oo. 7 

Case 1. s = oo. Then by definition s, = oo for alln € N. Since s; = ov, there 
isann, € Nsuch that x,, > 1. Since s,,41 = ov, there isann2 >nj +1>n1 
such that x,, > 2. Continuing in this manner, we can choose a subsequence 
{Xn,} Such that x,, > k for allk ¢ N. Hence, it follows from the Squeeze 
Theorem for R (see Exercise 2.2.7) that xn, > 00 = s ask > ov. 

Case 2. s = —ow. Since sy, > x, for alln € Nand s, ~ —oasn > ow, it 
follows from the Squeeze Theorem for R that x, ~ —oo =s asn > ov. 

Case 3. —co < 5 < oo. Set no = 0. By Theorem 1.14 (the Approximation 
Property for Suprema), there is an integer n; € N such that s,)41 — 1 < x», < 
Sno+1- Similarly, there is an integer n2 > nj + 1 > mn; such that sy,41 — 1/2 < 
Xn. < Sn,41. Continuing in this manner, we can choose integers n1 < n2 <--- 
such that 


1 
Snjpj+1 — i <Xny < Sny_ 441 (5) 


fork €N. Since sy, ,4+1 — s ask — oo, we conclude by the Squeeze Theorem 
that xn, > s ask —> oo. a 


This observation leads directly to a characterization of limits in terms of limits 
infimum and limits supremum. 


2.36 Theorem. Let {x,} be a real sequence and x be an extended real number. 
Then x, > x asn — ooif and only if 


lim sup x, = liminf x, = x. (6) 
noo re oe 


Proof. Suppose that x, — x asn — oo. Then x,, —- x ask — oo for 
all subsequences {x,,}. Hence, by Theorem 2.35, limsup,_,.,%n = x and 
lim inf, Xn = x; that is, (6) holds. 

Conversely, suppose that (6) holds. 

Case 1. x = +00. By considering +x, we may suppose that x = oo. Thus 
given M ¢€ R there is an N € N such that infysy x, > M. It follows that 
Xn > M for alln > N; that is, x, > oo asn > o. 

Case 2. —co < x < oo. Let e > 0. Choose N € N such that 


€ : € 
supx,—x <= and x-— inf x%< =. 
k>N 2 k>N 2 
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Let n,m > N and suppose for simplicity that x, > x». Then 


. —é & 
IXn —Xm| = Xn — Xm < sup x, —x+x—-— inf x<-+2=6. 
k>N k>N 2 2 


Thus {x,} is Cauchy and converges to some finite real number. But by Theo- 
rem 2.35, some subsequence of {x,} converges to x. We conclude that x, > x 
asn —> oO. a 


Theorem 2.35 also leads to the following geometric interpretation of limits 
supremum and limits infimum. 


2.37 Theorem. Let {x,} be a sequence of real numbers. Then limsup,_, 95 Xn 
(respectively, lim infy—so0 X) is the largest value (respectively, the smallest value) 
to which some subsequence of {xn} converges. Namely, if xn, > x ask + o, 
then 


liminfx, <x <limsup x). (7) 
now n—-oo 


Proof. Suppose that x,, — x ask — oo. Fix N € N and choose K so large 
that k > K implies nz > N. Clearly, 


inf xj < xy, < sup x; 
j2N j=N 


for allk > K. Taking the limit of this inequality as k — oo, we obtain 


inf x; <x < sup x;. 
j2N j=N 


Taking the limit of this last inequality as N — oo and applying Definition 2.32, 
we obtain (7). a 


We close this section with several other properties of limits supremum and 
limits infimum. 


2.38 Remark. /f {x,} is any sequence of real numbers, then 


lim inf x, < lim sup xp. 
noo n—>oo 


Proof. Since infg>n xx < supys, Xx for all n € N, this inequality follows from 
Theorem 2.17 (the Comparison Theorem). i 


The following result is an immediate consequence of Definition 2.32, the 
Comparison Theorem, and the Monotone Convergence Theorem. 


2.39 Remark. A real sequence {x,} is bounded above if and only if 
lim sup,_,59Xn < 00, and is bounded below if and only if lim infn_, 90 X) > —00. 
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The following result shows that we can take limits supremum and limits infi- 
mum of inequalities. 


2.40 Theorem. If x, < y, forn large, then 


limsupx, <limsupy, and liminfx, <liminf y,. (8) 
noo noo ESR racic, 


Proof. If xx < yx fork = N, then supys,xk < Supys, Ye and infesn xe < 
infy>n yx for any n > N. Taking the limit of these inequalities as n — oo, we 


obtain (8). | 
EXERCISES 
2.5.1. Find the limit infimum and the limit supremum of each of the following 
sequences. 


a) X, =14+(-1)" 

b) x, = sin (n7/2) 

©) xm = ((-D" —(1)"*1) /n 
d) x, = (n+ 4)/Vn? — 16 
€) Xn = (2y, + 1)/n, where {y,} is any bounded sequence 
Dm = ((-)" = CI") n 

2) Xp = (n* — 2n? + 5)/(n? + 5n — 1) 


2.5.2. Suppose that {x,} is a real sequence. Prove that 


— lim sup x, = lim inf(—xp) 
n> oo DOS 


and 


— liminf x, = lim sup(—x;,). 
RTO noo 


2.5.3. Let {x,} be a real sequence andr € R. 
a) Prove that 


limsupx, <r implies x, <r 
no 


for n large. 
b) Prove that 


limsupx, >r implies x, >r 
now 


for infinitely many n EN. 
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2.5.4. Suppose that {x,} and {y,} are real sequences. 


a) Prove that 
lim inf x, + lim inf y, < liminf(x, + yn) 
n—>Co n—->Co n—->Co 
< lim sup x, + lim inf y;, 
n> oo 2 OS 


< lim sup(x, + y,) < limsup x, + lim sup yp, 
noo noo noo 


provided that none of these sums is of the form co — oo. 
b) Show that if limy_. 95 xn exists, then 


liminf(x, + y,) = lim x, + liminf y, 
noo n>oo noo 
and 
lim sup(%, + yn) = lim x, + limsup yy. 
noo n> oo n>0oo 


c) Show by examples that each of the inequalities in part (a) can be strict. 


2.5.5. Let {x,} and {y,} be real sequences. 


a) Suppose that x, > 0 and y, > 0 for eachn EN. Prove that 
lim sup(% yn) < (lim sup x,) (lim sup yn), 
n—>oo n—> oo noo 
provided that the product on the right is not of the form 0 - oo. Show 
by example that this inequality can be strict. 


b) Suppose that x, <0 < y, forn €N. Prove that 


(im inf x,)(lim sup y,) < liminf(x; yy), 
n—->Co n—->oo n—->Co 


provided that none of these products is of the form 0 - oo. 


2.5.6. Suppose that x, > 0 and y, > 0 for alln € N. Prove that if x, — x as 


n —> oo (x may be an extended real number), then 
lim sup(%n Yn) = x lim sup yn, 
now noo 


provided that none of these products is of the form 0 - oo. 


2.5.7. Prove that 


n—>0o ne k>n neN \k2n 


lim sup x, = inf (x2) and liminfx, = sup (int) 
N n—>oo 


for any real sequence {x,}. 
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2.5.8. Suppose that x, > 0 for n € N. Under the interpretation 1/0 = oo and 
1/oo = 0, prove that 


1 1 1 
lim sup = —— and liminf — = - : 
noo Xn lim infy_s 60 Xn N>OO Xp lim sup,_566 Xn 


2.5.9. Let x, € R. Prove that x, > 0 asn — oo if and only if 


lim sup |x,| = 0. 
n->Co 
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3.1 TWO-SIDED LIMITS 


In the preceding chapter we studied limits of real sequences. In this chapter we 
examine limits of real functions; that is, functions whose domains and ranges are 
subsets of R. To distinguish such functions from functions whose ranges include 
oo and/or —oo, we shall sometimes refer to real functions as finite valued. 

Recall from elementary calculus that a function f(x) converges to a limit L, 
as x approaches a, if f(x) is near L when x is near a. Here is a precise definition 
of this concept. 


3.1 Definition. 


Let a € R, let J be an open interval which contains a, and let f be a real 
function defined everywhere on J except possibly at a. Then f(x) is said to 
converge to L, as x approaches a, if and only if for every « > 0 there isa dé > 0 
(which in general depends on e, f J, and a) such that 

0 < |x—a|<6 implies |f(x)—-L| <e. (1) 


In this case we write 


L=lim f(x) or f@-L as x—a, 
xa 


and call L the limit of f(x) as x approaches a. 


As was the case for sequences, ¢ represents the maximal error allowed in the 
approximation f(x) to L. In practice, the number 6 represents the tolerance 
allowed in the measurement x of a which will produce an approximation f (x) 
which is acceptably close to the value L. 

According to Definition 3.1, to show that a function has a limit, we must begin 
with a general ¢ > 0 and describe how to choose a 5 which satisfies (1). 


3.2 EXAMPLE. 
Suppose that f(x) = mx + b, where m, b € R. Prove that 


f(a) = lim f(x) 
for alla ER. 
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Proof. If m = 0, there is nothing to prove. Otherwise, given ¢« > 0, set 6 = 
é/|m|. If |x —a| < 6, then 


f(x) — f(a@)| = |mx + b — (ma + b)| = |m| |x — a] < |m|o =e. 
Thus by definition, f(x) > f(a) asx > a. | 


Sometimes, in order to determine 6, we must break f(x) — L into two factors, 
replacing the less important factor by an upper bound. 


3.3 EXAMPLE. 
If f(x) =x? +x- 3, prove that f(x) ~ —lasx —> 1. 


Proof. Let « > 0 and set L = —1. Notice that 
f@)—-L=x*?4+x-2=(¢—-Da +2). 


If 0 <6 < 1, then |x — 1| < 6 implies 0 < x < 2, so by the triangle inequality, 
|x + 2| < |x| +2 < 4. Set 6 = min{1, ¢/4}. It follows that if |x — 1| < 6, then 


| f(x) — L| = |x — 1| |x +2] < 4|x —1| < 46 <e. 
Thus by definition, f(x) > Lasx > 1. | 


Before continuing, we would like to draw your attention to two features of 
Definition 3.1: Assumption 1. The interval / is open; Assumption 2. 0 < |x —al. 
If J = (c,d) is an open interval and 69 := min{a — c,d — a}, then |x —a| < 80 
implies x € J. Hence, Assumption 1 guarantees that for 6 > 0 sufficiently small, 
f(x) is defined for all x 4 a satisfying |x — a| < 6 (i.e., on BOTH sides of a). 
Since |x — a| > 0 is equivalent to x # a, Assumption 2 guarantees that f can 
have a limit at a without being defined at a. (This will be crucial for defining 
derivatives later.) 

The next result shows that even when a function f is defined at a, the value 
of the limit of f at a is, in general, independent of the value f(a). 


3.4 Remark. Leta € R, let I be an open interval which contains a, and let fg be 
real functions defined everywhere on I except possibly at a. If f (x) = g(x) for all 
x € 1 \ {a} and f(x) > Las x — a, then g(x) also has a limit as x — a, and 


lim g(x) = lim f(x). 

xXx—7a xa 
Proof. Let « > 0 and choose 6 > 0 small enough so that (1) holds and |x—a| < 
6 implies x € I. Suppose that 0 < |x —a| < 6. We have f(x) = g(x) by 
hypothesis and | f(x) — L| < e by (1). It follows that |g(x) — L| < «. a 


Thus to prove that a function f has a limit, we may begin by simplifying f 
algebraically, even when that algebra is invalid at finitely many points. 
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3.5 EXAMPLE. 
Prove that 


Py 2 ese 
oe xe = 1 


has a limit as x —> 1. 


Proof. Set f(x) = x-+1 and observe by Example 3.2 that f(x) > 2asx —> 1. 
Since 


PSR Se De = 1 
g(x) = 5 = 5 
x“ — 1 x“ — I 


= Fx) 


for x # +1, it follows from Remark 3.4 that g(x) has a limit as x — 1 (and 
that limit is 2). a 


There is a close connection between limits of functions and limits of 


sequences. 


3.6 Theorem. [SEQUENTIAL CHARACTERIZATION OF LIMITS]. 
Leta €R, let I be an open interval which contains a, and let f be a real function 
defined everywhere on I except possibly at a. Then 


L = lim f(x) 


exists if and only if f (x,) > Las n — oo for every sequence x, € I \ {a} which 
converges to aasn > ©. 


Proof. Suppose that f converges to L as x approaches a. Then given ¢ > 0 
there is a 6 > O such that (1) holds. If x, € J \ {a} converges to a asin —> ov, 
then choose an N € N such that n > N implies |x, — a| < 6. Since x, # a, it 
follows from (1) that | f(x,) — L| < e for alln => N. Therefore, f(x,) > L as 
n> oO. 

Conversely, suppose that f(x,) — L asn — oo for every sequence x, € I \ 
{a} which converges to a. If f does not converge to L as x approaches a, then 
there is an e > 0 (call it ¢9) such that the implication “0 < |x — a| < 6 implies 
| f(x) — L| < €9” does not hold for any 6 > 0. Thus, for each 6 = 1/n, n EN, 
there is a point x, € J which satisfies two conditions: 0 < |x, —a| < 1/n 
and |f(x,) — L| => &9. Now the first condition and the Squeeze Theorem 
(Theorem 2.9) imply that x, 4 a and x, — a so by hypothesis, f(x,) > L, 
as n — oo. In particular, | f(x,) — L| < 0 for n large, which contradicts the 
second condition. a 


Thus to show that the limit of a function f does not exist as x + a, we need 


only find two sequences converging to a whose images under f have different 
limits. 
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3.7 EXAMPLE. 
Prove that 


1 
sin — x £0 

x 
x=0 


f@)= 


has no limit as x — 0. 


AY 


Se ae 
%Y 


FIGURE 3.1 


Proof. By examining the graph of y = f(x) (see Figure 3.1), we are led to 
consider two extremes: 


2 
oF. Bi ete N. 
Gate “ "" Gnaan oe 


Ayn: 


Clearly, both a, and by, converge to 0 as n — oo. On the other hand, since 
f(@n) = land f(b,) = —1 for alln E N, f(a,) ~ 1 and f(b,) > —1 as 
n — oo. Thus by Theorem 3.6, the limit of f(x), as x > 0, cannot exist. | 


Theorem 3.6 also allows us to translate results about limits of sequences 


to results about limits of functions. The next three theorems illustrate this 
principle. 


Before stating these results, we introduce an algebra of functions. Suppose 
that f,g : E — R. For each x €¢€ E, the pointwise sum, f + g, of f and g is 


defined by 


(f + 8)(x) = Ff) + 8), 
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the scalar product, af , of a scalar a € R with f, by 
(af)(x) = af (x), 
the pointwise product, fg, of f and g, by 
(fg)@) := f@)g(x), 


and (when g(x) 4 0) the pointwise quotient, f/g, of f and g, by 


f\ 0 f@) 
(£) er g(x) 


The following result is a function analogue of Theorem 2.12. 


3.8 Theorem. Suppose that a € R, that I is an open interval which contains a, 
and that fg are real functions defined everywhere on I except possibly at a. If f (x) 
and g(x) converge as x approaches a, then so do (f + g)(x), (fg)(x), (af) (x), 
and (f/g)(x) (when the limit of g(x) is nonzero). In fact, 


lim (f + 8) @) = lim f(x) + lim g(x), 
Jim (@f) @) = @ lim f(x), 
Jim (fg) Ge) = lim f(x) lim g(x), 
and (when the limit of g(x) is nonzero) 
lim (“) (x) = mcrae 
& limy +a g(x) 
Proof. Let 
L:= Jim f(x) and M:= Jim g(x). 


If x, € 1\{a} converges to a, then by Theorem 3.6, f(x,) — Land g(x,) > M 
as n — oo. By Theorem 2.121, f(x,) + g@n) > L+M asn — ow. Since this 
holds for any sequence x, € J \ {a} which converges to a, we conclude by 
Theorem 3.6 that 


lim (f +8) (@) =L+M = lim f(x) + lim g(x). 


The other rules follow in an analogous way from Theorem 2.12ii through 
2.12iv. a 


Similarly, the Sequential Characterization of Limits can be combined with 
the Squeeze and Comparison Theorems for sequences to establish the following 
results. 
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3.9 Theorem. [SQUEEZE THEOREM FOR FUNCTIONS]. 
Suppose that a € R, that I is an open interval which contains a, and that f.g,h 
are real functions defined everywhere on I except possibly at a. 


i) If g(x) < h(x) < f(x) forall x € I \ {a}, and 
lim f(x) = lim gv) =L, 
xa x—a 
then the limit of h(x) exists, as x — a, and 
lim h(x) = L. 
xa 
ii) If |g(x)| < M for all x € 1 \ {a} and f (x) > Oas x — a, then 
lim f(x)g(x) = 0. 
xa 
The preceding result is illustrated in Figure 3.2. 


Ay 


#Y 


FIGURE 3.2 


3.10 Theorem. [COMPARISON THEOREM FOR FUNCTIONS]. 

Suppose that a € R, that I is an open interval which contains a, and that fg are 
real functions defined everywhere on I except possibly at a. If f and g have a 
limit as x approaches a and f(x) < g(x) forall x € I \ {a}, then 


lim f(x) < lim g(x). 
xX—a xa 


We shall refer to this last result as taking the limit of an inequality. 
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The limit theorems (Theorems 3.8, 3.9, and 3.10) allow us to prove that limits 
exist without resorting to e’s and 8’s. 


3.11 EXAMPLE. 
Prove that 


x—1 _ 
Zr ee 


lim 
x1 


Proof. By Example 3.2, x — 1 > 0 and 3x +1— 4asx — 1. Hence, by 


Theorem 3.8, (x — 1)/Gx +1) ~ 0/4=0asx > 1. a 
EXERCISES 
3.1.0. Let a € R and let f and g be real functions defined at all points x in 


some open interval containing a except possibly at x = a. Decide which 
of the following statements are true and which are false. Prove the true 
ones and give counterexamples for the false ones. 


a) For eachn €N, the function (x — a)"sin(f (x)(x — a)~") has a limit 
asx >a. 

b) Suppose that {x,} is a sequence converging to a with x, #4 a. If 
f(%n) > Lasn > o, then f(x) > Lasx > a. 

c) If f and g are finite valued on the open interval (a — 1,a + 1) and 
f(x) > Oasx — a, then f(x)g(x) > Oasx > a. 

d) If lim,+, f(x) does not exist and f(x) < g(x) for all x in some open 
interval J containing a, then lim,_,, g(x) doesn’t exist either. 


3.1.1. Using Definition 3.1, prove that each of the following limits exist. 
a) lim x* +5x+6= 12 
xz 
2_ 5, — 
By) in x* —5x-—6 i: 
x21 x+1 
c) lim x*—2x7+1=0 
x1 
d) lim x cos(zx) = 0 
x0 
3.1.2. Decide which of the following limits exist and which do not. Prove that 


your answer is correct. (You can use well-known facts about the values 
of tanx, sinx, and log x, e.g., that log x > —oo asx — 0°.) 


a) lim tan (=) 


x0 


3.1.3. 


3.1.4. 
3.1.5. 
3.1.6. 


3.1.7 |. 
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x0 x4 


244 
b) lim x? sin (- a ) 


i 
e) pee log x3 


Evaluate the following limits using results from this section. (You may 
assume that sinx, 1 —cosx, tanx, and \/x converge to 0 as x > 0.) 


x7 +6x +5 


Stan(x — 2) 
c) lim ———— 


x2 cos(2x — 4) 


ay dim 2sinx — sin2x 
x30 1 —cos2(x) 


1 
e) lim cos (* . ) log(x + 1) 
Xx 


x20 


Prove Theorem 3.9. 
Prove Theorem 3.10. 
Suppose that f is a real function. 


a) Prove that if 
L = lim f(x) 
xXx7a 


exists, then | f(x)| > |L| asx > a. 
b) Show that there is a function such that, as x > a, |f(x)| — |L| but 
the limit of f(x) does not exist. 


This exercise is used in Sections 3.2 and 5.2. For each real function f, 
define the positive part of f by 


IF) + FO) 


i @= 5 x € Dom (f) 
and the negative part of f by 
fgets x € Dom (f). 


2 
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a) Prove that ft(x) > 0, f-() = 0, f(x) = ft(x) — f(x), and 
| f(x)| = ft() + f7 (x) all hold for every x € Dom(f). (Compare 
with Exercise 1.2.3.) 

b) Prove that if 


L = lim f(x) 
xa 
exists, then f*(x) > L* and f~(x) > L7 asx > a. 


This exercise is used in Sections 3.2 and 5.2. Let f, g be real functions 
and for each x € Dom (f) N Dom (g) define 


(f Vv g)(x) = max{f(x), g(@x)} and (f A g)(x) = min{ f(x), g(x)}. 
a) Prove that 


(fF +8)@) +17Ff — g)Q)I 


(f Vv 3) = 5 


and 


(f + 8)@) —I(f — 8)Q)| 


(f Ag) = 5 


for all x € Dom (f) N Dom (g). 
b) Prove that if 


L= lim f(x) and M = lim g(x) 
xa xa 


exist, then (f V g)(x) > LV Mand (fA g)(x) > LAMasx — a. 


Suppose that a € R and / is an open interval which contains a. If f : 
I > R satisfies f(x) > f(a), as x — a, and if there exist numbers M 
and m such that m < f(a) < M, prove that there exist positive numbers 
é and 6 such that 


m+eée< f(x)<M-e 


for all x’s which satisfy |x —a| < 6. 


3.2 ONE-SIDED LIMITS AND LIMITS AT INFINITY 


In the preceding section we defined the limit of a real function. In this section 
we expand that definition to handle more general situations. 

What is the limit of f(x) := /x —1 asx — 1? A reasonable answer is that 
the limit is zero. This function, however, does not satisfy Definition 3.1 because 
it is not defined on an OPEN interval containing a = 1. Indeed, f is only defined 
for x > 1. To handle such situations, we introduce “one-sided” limits. 
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3.12 Definition. 
Let a € Rand f be a real function. 


i) f(x) is said to converge to L as x approaches a from the right if and only if f 
is defined on some open interval J with left endpoint a and for every ¢ > 0 
there is a5 > 0 (which in general depends one, f, 7, and a) such that 


a+édeI and a<x<a+é imply |f(x)-L|<e. (2) 
In this case we call L the right-hand limit of f at a, and denote it by 


flat) :=L=: im, £@). 


ii) f(x) is said to converge to L as x approaches a from the left if and only if 
f is defined on some open interval 7 with right endpoint a and for every 
€ > 0 there is a 5 > 0 (which in general depends on ¢, f, 7, and a) such 
thata —5 € Janda—6 <x <aimply|f(x) —L| < e. In this case we call 
L the left-hand limit of f at a and denote it by 


f(a-):=L=: im f(x). 


It is easy to check that when two-sided limits are replaced with one-sided 
limits, all the limit theorems from the preceding section hold. We shall use them 
as the need arises without further comment. 

Existence of a one-sided limit can be established by these limit theorems or 
by appealing directly to the definition. 


3.13 EXAMPLES. 
i) Prove that 


(x) . 
FO) = x-1 x <0 

has one-sided limits at a = 0 but lim,_,9 f(x) does not exist. 
ii) Prove that 


lim jx =0. 


x—>0+ 


Proof. i) Let ¢ > Oand set 6 =e. If0 < x < 6, then | f(x) — 1] = |x| <d =<. 
Hence lim,.04 f(x) exists and equals 1. Similarly, lim,_.9_ f(x) exists and 
equals —1. However, x, = (—1)"/n > 0 but f(x) = (—1L)"(1 + 1/n) does 
not converge as n — oo. Hence by the Sequential Characterization of Limits, 
lim,_s9 f(x) does not exist. 

ii) Let e > O and set 5 = €”. If0 < x <6, then|f(x)|= /x< /é=ec. J 
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Not every function has one-sided limits (see Example 3.7). Examples 3.13 
show that even when a function has one-sided limits, it may not have a two- 
sided limit. The following result, however, shows that if both one-sided limits, 
at a point a, exist and are EQUAL, then the two-sided limit at a exists. 


3.14 Theorem. Let f be areal function. Then the limit 


lim f(x) 
xa 
exists and equals L if and only if 
L= lim f(x) = lim f(x). (3) 
X>at+ x>a— 


Proof. If the limit L of f(x) exists as x > a, then given e > 0 choose 6 > 0 
such that 0 < |x —a| < 6 implies | f(x) — L| < e. Since any x which satisfies 
a<x<a+éora—6 <x <a also satisfies 0 < |x —a| < 4, it is clear that 
both the left and right limits of f(x) exist as x — a and satisfy (3). 
Conversely, suppose that (3) holds. Then given ¢ > 0 there exists a 6) > 0 
(respectively, a 52 > 0) such that a < x < a+6, (respectively, a — 62 < x <a) 
implies | f(x) — L| < e. Set 6 = min{d;, 62}. Then 0 < |x —a| < 6 implies 
a<x <a+6, ora —6) < x < a (depending on whether x is to the right or 
to the left of a). Hence (1) holds; that is, f(x) > Lasx —> a. a 


The definition of limits of real functions can be expanded to include extended 
real numbers. 


3.15 Definition. 
Let a, L € Rand let f be a real function. 


i) f(x) is said to converge to L as x — oo if and only if there exists ac > 0 
such that (c, co) C Dom(f) and given e > 0 there is an M ¢€ R such that 
x > M implies | f(x) — L| < ¢, in which case we shall write 


lim fx) =L or f(x) > Lasx > oo. 
x>CO 


Similarly, f(x) is said to converge to L as x — —oo if and only if there exists 
ac > Osuch that (—oo, —c) C Dom(f) and given e > 0 there isan MER 
such that x < M implies | f(x) — L| < ¢, in which case we shall write 


lim f(x)=L or f(x) > Lasx > oo. 
x7 0CO 


il 


wa 


The function f(x) is said to converge to co as x — a if and only if there is 
an open interval J containing a such that J\ {a} C Dom(f) and given M ER 
there is a 6 > O such that 0 < |x —a| < 6 implies f(x) > M, in which case 
we shall write 


lim f(x) = oc or f(x) > wasx— a. 
xa 
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Similarly, f(x) is said to converge to —oo as x — aif and only if there is an 
open interval J containing a such that J \ {a} C Dom(f) and given M ER 
there isa 6 > 0 such that 0 < |x —a| < 6 implies f(x) < M, in which case 
we shall write 


lim f(x) = —co or f(x) > -ooasx > a. 
xa 


Obvious modifications of Definition 3.15, which we leave to the reader, can 
be made to define f(x) > +oo asx > a+andx — a-, and f(x) > +0oo as 
x — too. 


3.16 EXAMPLES. 


i) Prove that 1/x + 0 asx — oo. 
ii) Prove that 


2 
lim f(x) = lim ao 00. 
x>1- 


x>1- 2x? —3x+1 fe 


Proof. i) Given ¢ > 0, set M = I/e. Ifx > M, then |1/x| = 1/x < 1/M =e. 
Thus 1/x > 0 as x — oo. 

ii) Let M € R. We must show that f(x) < M for x near but to the left of 1 
(no matter how large and negative M is). Without loss of generality, assume 
that M < 0. As x converges to 1 from the left, 2x” — 3x + 1 is negative and 
converges to 0. (Observe that 2x* — 3x +1 is a parabola opening upward with 
roots 1/2 and 1.) Therefore, choose 6 € (0, 1) such that 1 — 6 < x < 1 implies 
2/M < 2x* —3x +1 < 0; that is, —1/(2x7 — 3x + 1) > —M/2 > 0. Since 0 < 
x < lalsoimplies 2 < x+2 < 3, it follows that —(x+2)/(2x*-—3x+1) > —M; 
that is, 


x+2 
= —~—_——_ < M 
FO) 2x2 —3x+1 


foralll-—d <x <1. |_| 


In order to unify the presentation of one-sided, two-sided, and infinite limits, 
we introduce the following notation. Let a be an extended real number, and let 
I be anondegenerate open interval which either contains a or has a as one of its 
endpoints. Suppose further that f is a real function defined on J except possibly 
at a. If a is finite and J contains a, then 


Jim f(x) (4) 


xel 


will denote lim,., f(x) (when it exists); if a is a finite left endpoint of J, then 
(4) will denote lim,-.+ f(x) (when it exists); if a is a finite right endpoint of /, 
then (4) will denote lim,—.,— f(x) (when it exists); if a = oo is an endpoint of 
T, then (4) will denote limy_,+.5 f(x) (when each exists). 

Using this notation, we can state a Sequential Characterization of Limits valid 
for two-sided, one-sided, and infinite limits. 
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3.17 Theorem. Let a be an extended real number, and let I be a nondegenerate 
open interval which either contains a or has a as one of its endpoints. Suppose 
further that f is a real function defined on I except possibly at a. Then 


lim f(x) 
ee 


exists and equals L if and only if f (x,) — L for all sequences x, € I which satisfy 
XxX, #aand x, > aasn —> oO. 


Proof. Since we have already proved this for two-sided limits, we must show 
it for the remaining eight cases which notation (4) represents. Since the proofs 
are similar, we shall give the details for only one of these cases, namely the 
case when a belongs to J and L = oo. Thus we must prove that f(x) — oo as 
x — aif and only if f(x,) — oo for any sequence x, € I which converges to 
a and satisfies x, 4 a forn —N. 

Suppose first that f(x) > coasx > a. If x, € I, % > aasn > ~w, 
and x, 4 a, then given M ¢€ R there isa é > 0 such that 0 < |x —a| < 6 
implies f(x) > M, and there is an N € Nsuch that n > N implies |x, —a| < 6. 
Consequently, n > N implies f(x,) > M; that is, f(%,) > oo asn > oo as 
required. 

Conversely, suppose to the contrary that f(x,) — oo for any sequence 
Xn € I which converges to a and satisfies x, 4 a but f(x) does NOT converge 
to co as x — a. By the definition of “convergence” to oo there are numbers 
Mo € Rand x, € J such that |x, — a| < 1/n and f(x) < Mo for alln EN. 
The first condition implies x,, > a but the second condition implies that f(x,) 
does not converge to co as n — ov. This contradiction proves Theorem 3.17 
in the case a € J and L = oo. a 


Using Theorem 3.17, we can prove limit theorems that are function analogues 
of Theorem 2.15 and Corollary 2.16. We leave this to the reader and will use 
these results as the need arises. 

These limit theorems can be used to evaluate infinite limits and limits at too. 


3.18 EXAMPLE. 
Prove that 


Proof. Since the limit of a product is the product of the limits, we have by 
Example 3.161 that 1/x” — 0 as x > oo for any m € N. Multiplying numera- 
tor and denominator of the expression above by 1/x”, we obtain 


Pen | 2. yi limgsagOe 1/07) i) 
= 1m — = 
x00 J— x2 x00 —1 4 1/x2 limy_,.(—1 4+ 1/x2) — -1 


=-—2. 
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EXERCISES 


3.2.0. 


3.2.1. 


3.2.2. 


Decide which of the following statements are true and which are false. 
Prove the true ones and provide counterexamples for the false ones. 


a) If f(x) > cw asx > ow and g(x) > 0, then g(x)/f(x) > 0 as 
xX > OO. 

b) If f(x) > Oasx > a+and g(x) = 1 forallx € R, then g(x)/f(x) > 
oo as x > a+. 

c) If f(x) > co asx —> oo, then sin(x? + x + 1)/f(x) > Oas x > oo. 

d) If P and Q are polynomials such that the degree of P is less than or 
equal to the degree of Q (see Exercise 3.2.3), then there isan L ER 
such that 


P(x) _ P(x) 
im ——= lim ——= 
x00 O(x) x+-00 O(x) 


For each of the following, use definitions (rather than limit theorems) 
to prove that the limit exists. Identify the limit in each case. 


ag 
a) lim — 


x>1- Y(x — 1)4 


COS X 


b) lim 


xX->0O X 


ii 

c) lim ————— 

xo-1+ x2 —5x —6 
ay 

a) lis — 

xolt+ 1l+x — 2x2 


; sin(e°S*) 
e) lim ———— 
x>-00 x4+2 


Assuming that e* — e“, sinx — sina, and cosx — cosa as x — a for 
any a € R, evaluate the following limits when they exist. 


Si Dy es Oye 
a) tin 


x>37 x2 — 9 


3x* 4x7 =—5 
b) lim —W—— 
x00 2x4 + 2x3 — 1 


a Sx+1 
c) lim e° 
X—>—OO 
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ex +5x+6 
d) lim ——— 
x>0- cosx — | 
e) lim pose) 
x>0 VW sin? x 
. V1 — sin2x 
f) lim ———_— 
x>1/4 cos 2x 
3.2.3 |. This exercise is used many places. Recall that a polynomial of degree n 
is a function of the form 
P(x) = nx" + an—ix""! +--+ a1x +0, 
where aj € Rfor j =0,1,...,n anda, £0. 
a) Prove that lim,., x” =a” forn =0,1,--- anda eR. 
b) Prove that if P is a polynomial, then 
lim P(x) = P(a) 
xa 
for everya ER. 
3.2.4. Prove the following comparison theorems for real functions f and g, 
anda eR. 
a) If f(x) = g(x) and g(x) > wasx > a, then f(x) > coasx >a. 
b) If f(x) < g(x) < A(x) and 
L:= lim f(x) = lim h(x), 
X00 X—>O0O 
then g(x) ~ Lasx > ~. 
3.2.5. Prove the following special case of Theorem 3.17: Suppose that a € R 
and f : [a,co) ~ R forsomea € R. Then f(x) — Las x — oo if and 
only if f(x,) — L for any sequence x, € (a, oo) which converges to oo 
asn —> oOo, 
3.2.6. Suppose that f : [0,1] ~ Rand f(a) = limy.q f(x) for all a € [0, 1]. 
Prove that f(g) = 0 for all g € QN (0, 1) if and only if f(x) = 0 for all 
x € [0, 1]. 
3.2.7. Suppose that P is a polynomial and that P(a) > 0 for a fixeda € R. 


Prove that P(x)/(x —a) > ooasx > a+, P(x)/(x — a) > —oo as 
x — a—, but 


does not exist. 
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3.2.8. [CAucHy] Suppose that f : N > R. If 
lim fr+1)— ff) =L, 
nwo 
prove that limp... f(n)/n exists and equals L. 


3.3. CONTINUITY 


In elementary calculus, a function is called continuous at a ifa € Dom f and 
f(x) > f(@ asx — a. In particular, it is tacitly assumed that f is defined 
on BOTH sides of a. Here, we introduce a more general concept of continuity 
which includes functions, such as ,/x at a = 0, which are defined on only one 
side of some point in their domain. 


3.19 Definition. 
Let E be nonempty subset of Rand f: E — R. 


i) f is said to be continuous ata pointa € E if and only if given e > 0 there is 
a 6 > 0 (which in general depends on «, f, and a) such that 


Ix—al|<6 and xe€E imply |f@)—-f@|<e. (5) 


ii) f is said to be continuous on E (notation: f : E > Ris continuous) if and 
only if f is continuous at every x € E. 


The following result shows that if E is an open interval which contains a, then 
“f is continuous at a € E” means “ f(x) > f(a) asx — a.” Therefore, we shall 
abbreviate “f is continuous at a € E” by “f is continuous at a” when E is an 
open interval. 


3.20 Remark. Let I be an open interval which contains a pointaand f :I > R. 
Then f is continuous ata € I if and only if 


f(a) = lim f(x). 


Proof. Suppose that J = (c,d) and set 59 := min{|c — al, |d — al}. If 8 < 8p, 
then |x — a| < 6 implies x € J. Therefore, condition (5) is identical to (1) 
when f(a) = L, E =I, and6 < 6p. It follows that f is continuous ata ¢€ / if 
and only if f(x) > f(a) asx > a. | 


By repeating the proof of Theorem 3.6, we can establish a sequential charac- 
terization of continuity which is valid on any nonempty set. 


3.21 Theorem. Suppose that E is a nonempty subset of R, that a € E, and that 
f: ER. Then the following statements are equivalent: 


i) fis continuous ata € E. 
ii) If x, converges to aand x, € E, then f (xn) > f(a) asn— oo. 
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In particular, ./x is continuous on J = [0, oo) by Exercise 2.2.5. 
By combining Theorem 3.21 with Theorem 2.12, we obtain the following result. 


3.22 Theorem. Let E be a nonempty subset of Rand f,g:E —> R. If fg are 
continuous at a point a € E (respectively, continuous on the set E), then so are 
f +g, fg, and af (for any a € R). Moreover, f/g is continuous ata € E when 
g(a) £0 (respectively, on E when g(x) #0 forall x € E). 


It follows from Exercises 3.1.6, 3.1.7, and 3.1.8 that if fg are continuous at a 
point a € E or onaset E, then so are |f|, ft, f~, f Vg, and f A g. We also 
notice by Exercise 3.2.3 that every polynomial is continuous on R. 

Many complicated functions can be broken into simpler pieces, using sums, 
products, quotients, and the following operation. 


3.23 Definition. 


Suppose that A and B are subsets of R, that f : A > Randg: B-R. If 
f(A) C B for every x € A, then the composition of g with f is the function 
gof:A— R defined by 


(go f)(x) = g(f(x)), eA: 


The following result contains information about when a limit sign and some- 
thing else (in this case, the evaluation of a function) can be interchanged. We 
shall return to this theme many times, identifying conditions under which we 
can interchange any two of the following objects: limits, integrals, derivatives, 
infinite summations, and computation of a function (see especially Sections 7.1, 
7.2, and 11.1, and the entry “interchange the order of” in the Index). 


3.24 Theorem. Suppose that A and B are subsets of R, that f : A > Rand 
g:B—R, and that f(x) € B foreveryx € A. 


i) If A :=1 \ {a}, where I is a nondegenerate interval which either contains a or 
has a as one of its endpoints, if 


L:= lim f(x) 


xel 
exists and belongs to B, and if g is continuous at L € B, then 
lim (go f)(x) = 8 (1,700) . 
eae eee 


ii) If f is continuous ata € A and g is continuous at f(a) € B, then go f is 
continuous ata € A. 
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Proof. Suppose that x, € J \ {a} and that x. -~ aasn — oo. Since 
f(A) © B, fm) € B. Also, by the Sequential Characterization of Limits 
(Theorem 3.17), f(%,) > Lasn — oo. Since g is continuous at L ¢€ B, it 
follows from Theorem 3.21 that go f(x) := g(f(%)) > g(L) asn > ~. 
Hence by Theorem 3.17, g 0 f(x) ~ g(L) asx > ain J. This proves i). A 
similar proof establishes part ii). | 


For many applications, it is important to be able to find the maximum or min- 
imum of a given function. As a first step in this direction, we introduce the 
following concept. 


3.25 Definition. 


Let E be a nonempty subset of R. A function f : E > Ris said to be bounded 
on E if and only if there is an M € R such that | f(x)| < M for all x € EZ, in 
which case we shall say that f is dominated by M on E. 


Notice that whether a function f is bounded or not on a set E depends on 
E as well as on f. For example, f(x) = 1/x is dominated by 1 on [1, oo) but 
unbounded on (0,2). Again, the function f(x) = x” is dominated by 4 on (—2, 2) 
but unbounded on [0, oo). 

The following result, which will be used often, shows that a continuous func- 
tion on a closed, bounded interval is always bounded. 


3.26 Theorem. [EXTREME VALUE THEOREM]. 
If Lis a closed, bounded interval and f : I —> Ris continuous on I, then f is 
bounded on I. Moreover, if 


M=sup f(x) and m= inf f(x), 


xel 


then there exist points xm,xy € I such that 
fxm) =M and fn) =m. (6) 


Proof. Suppose first that f is not bounded on J. Then there exist x, € J 
such that 


lf (%n)| > 1, neN. (7) 


Since J is bounded, we know (by the Bolzano—Weierstrass Theorem) that {x} 
has a convergent subsequence, say x,, — a ask — oo. Since J is closed, 
we also know (by the Comparison Theorem) that a € J. In particular, 
f(a) € R. On the other hand, substituting n; for n in (7) and taking the limit 
of this inequality as k — oo, we have | f(a)| = oo, a contradiction. Hence, the 
function f is bounded on /. 

We have proved that both M and m are finite real numbers. To show that 
there is an xy € J such that f(xy) = M, suppose to the contrary that f(x) < 
M for all x € J. Then the function 
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1 


(x) = ——__ 
ee MFR) 

is continuous, hence bounded on /. In particular, there is a C > 0 such that 
|g(x)| = g(x) < C. It follows that 


f@) <M ~~ (8) 


for all x € J. Taking the supremum of (8) over all x € J, we obtain M < M — 
1/C < M, acontradiction. Hence, there is an xy € J such that f(xy) = M. 
A similar argument proves that there is an x, € J such that f(vm) =m. 


We shall sometimes refer to (6) by saying that the supremum and infimum 
of f are attained on I. We shall also call the value M (respectively, m) the 
maximum (respectively, the minimum) of f on I. 

Neither of the hypotheses on the interval J in Theorem 3.26 can be relaxed. 


3.27 Remark. The Extreme Value Theorem is false if either “closed” or 
“bounded” is dropped from the hypotheses. 


Proof. The interval (0,1) is bounded but not closed, and the function f(x) = 
1/x is continuous and unbounded on (0,1). The interval [0, co) is closed 
but not bounded, and the function f(x) = x is continuous and unbounded 
on [0, co). | 


What more can be said about continuous functions? One useful conceptual- 
ization of functions which are continuous on an interval is that their graphs have 
no holes or jumps (see Theorem 3.29 below). Our proof of this fact is based on 
the following elementary observation. 


3.28 Lemma. 

Suppose that a < band that f : [a,b) > R. If f is continuous at a point 
xo € [a,b) and f(xo) > 0, then there exist a positive number ¢ and a point 
x, € [a, b) such that x, > xq and f(x) > € forall x € [xo, x1]. 


STRATEGY: The idea behind the proof is simple. If f(x) > 0, then f(x) > 
f (xo)/2 for x near xo. Here are the details. 


Proof. Let « = f(xo)/2. Since xq < D, it is easy to see that 59 := (b — xo) /2 
is positive and that a < x < x9 + 69 implies x € [a, b). Use Definition 3.19 to 
choose 0 < 6 < dg such that x € [a, b) and |x—xo| < Sd imply | f(x)—f(xo)| < ¢. 

Fix x; € (x9, x9 + 6) and suppose that x € [xo, x1]. By the choice of ¢ and 6, 
it is clear that 


_ fo) Ff (x0) 
2 2 


<f') — Fo) < 


Solving the left-hand inequality for f(x), we conclude that f(x) > f(xo)/2 = 
€, aS promised. a 
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A real number yo is said to lie between two numbers c and d if and only if 
c<yo <dord < yo <c. 


3.29 Theorem. [INTERMEDIATE VALUE THEOREM]. 
Suppose that a < band that f : [a,b] > Ris continuous. If yo lies between 
f(@ and f(b), then there is an xq € (a, b) such that f (xo) = yo. 


Proof. We may suppose that f(a) < yo < f(b). Consider the set EF = 
{x € [a,b] : f(x) < yo} (see Figure 3.3). Since a € E and E C [a,b], E 
is a nonempty, bounded subset of R. Hence, by the Completeness Axiom, 
xo := sup E£ is a finite real number. It remains to prove that xo € (a,b) and 
f (x0) = yo- 


Yo 


fa) - 


a Xy Db 


FIGURE 3.3 


Choose by Theorem 2.11 a sequence x, € E such that x, > xp asin —> oo. 
Since E C [a, b], it follows from Theorem 2.17 that x9 € [a,b]. Moreover, 
by the continuity of f and the definition of EF, we have f(xo) = limp—+oo 
f(%n) < yo. 

To show that f(xo) = yo, suppose to the contrary that f(x0o) < yo. Then 
yo — f (x) is a continuous function on the interval [a, b) whose value at x = xo 
is positive. Hence, by Lemma 3.28, we can choose an ¢ and an x; > xo such 
that yo — f(x) > e > O. In particular, x; € E and x; > sup E, a contradiction. 

We have shown that xo € [a,b] and yo = f(xo). In view of our opening 
assumption, f(a) < yo < f(b), it follows that x9 cannot equal a or b. We 
conclude that xo € (a, b). a 


Thus, if f is continuous on [a,b] and f(a) < yo < f(b), then there is an 
xo € [a, b] such that f(xo0) = yo. 

If f fails to be continuous at a point a, we say that f is discontinuous at a 
and call a a point of discontinuity of f. How badly can a function behave near 
a point of discontinuity? The following examples can be interpreted as answers 
to this question. (See also Exercise 9.8.9.) 
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3.30 EXAMPLE. 
Prove that the function 


|x| 


FO Se as 
is continuous on (—oo, 0) and [0, oo), discontinuous at 0, and that both f(0+) 
and f(0—) exist. 


Proof. Since f(x) = 1 for x > 0, it is clear that f(0+) = 1 exists and 
f(@) — f@ asx — a for anya > 0. In particular, f is continuous on 


[0, co). Similarly, f(O—) = —1 and f is continuous on (—oo, 0). Finally, since 
fO+) 4 f(O-), the limit of f(x) as x — 0 does not exist by Theorem 3.14. 
Therefore, f is not continuous at 0. a 


3.31 EXAMPLE. 
Assuming that sin x is continuous on R, prove that the function 
ae | 
sin — x40 
x 
1 x=0 


f(x)= 


is continuous on (—oo, 0) and (0, oo), discontinuous at 0, and neither f(0+) nor 
f (O—) exists. (See Figure 3.1.) 


Proof. The function 1/x is continuous for x 4 0 by Theorem 3.8. Hence, 
by Theorem 3.24, f(x) = sin(1/x) is continuous on (—oo, 0) and (0, 00). To 
prove that f(0+) does not exist, let x, = 2/((2n + 1)z), and observe (see 
Appendix B) that sin(1/x,) = (-1)", n EN. Since x, | 0 but (—1)” does not 
converge, it follows from Theorem 3.21 (the Sequential Characterization of 
Continuity) that f(0+) does not exist. A similar argument proves that f(0—) 
does not exist. a 


3.32 EXAMPLE. 
The Dirichlet function is defined on R by 


1 x€EQ 


PROKS VG, - so, 


Prove that every point x € R is a point of discontinuity of f. (Such functions 
are called nowhere continuous.) 


Proof. By Theorem 1.18 and Exercise 1.3.3 (Density of Rationals and Irra- 
tionals), given any a € Rand 6 > 0 we can choose x; € Q and x2 € R\ Q such 
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that |x; — a| < 6 fori = 1,2. Since f(x1) = 1 and f(x2) = 0, f cannot be 
continuous at a. B 


3.33 EXAMPLE. 


Prove that the function 


f@M= . x= €Q _ (inreduced form) 
0 x€Q 


is continuous at every irrational in the interval (0,1) but discontinuous at every 
rational in (0,1). 


Proof. Let a be a rational in (0,1) and suppose that f is continuous at a. If 
Xn 1s a Sequence of irrationals which converges to a, then f(x,) > f(a); that 
is, f(a) = 0. But f(a) 4 0 by definition. Hence, f is discontinuous at every 
rational in (0,1). 

Let a be an irrational in (0,1). We must show that f(x,) > f(q@) for every 
sequence x, € (0, 1) which satisfies x, — a asn — oo. We may suppose that 
X, € Q. For eachn €N, write x, = py/gn in reduced form. Since f(a) = 0, 
it suffices to show that g, — oo asn — oo. Suppose to the contrary that 
there exist integers n} < nz < ... such that |qn,| < M < oo fork €N. Since 
Xn, € (0, 1), it follows that the set 


B= |i =P ken! 


contains only a finite number of points. Hence, the limit of any sequence in E 
must belong to £, a contradiction since a is such a limit and isirrational. MH 


To see how counterintuitive Example 3.33 is, try to draw a graph of y = f(x). 
Stranger things can happen. 


3.34 Remark. The composition of two functions g o f can be nowhere continu- 
ous, even though f is discontinuous only on Q and g is discontinuous at only one 


point. 


Proof. Let f be the function given in Example 3.33 and set 


1 
Ko=| ies 


0 x=0. 
Clearly, 
1 
(go f(x) = ne 


Hence, go f is the Dirichlet function, nowhere continuous by Example 3.32. 
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In view of Example 3.33 and Remark 3.34, we must be skeptical of proofs 
which rely exclusively on geometric intuition. And although we shall use ge- 
ometric intuition to suggest methods of proof for many results in subsequent 
chapters, these suggestions will always be followed by a careful rigorous proof 
which contains no fuzzy reasoning based on pictures or sketches no matter how 
plausible they seem. 


EXERCISES 
For these exercises, assume that sinx, cos x, and e* are continuous on R. 


3.3.0. Decide which of the following statements are true and which are false. 
Prove the true ones and provide counterexamples for the false ones. 


a) If f is continuous on [a,b] and J := f({a,b]), then J is a closed, 
bounded interval. 

b) If f and g are continuous on [a,b], if f(a) < g(a) and f(b) > g(b), 
then there is ac € [a, b] such that f(c) = g(c). 

c) Suppose that f and g are defined and finite valued on an open inter- 
val I which contains a, that f is continuous at a, and that f(a) 4 0. 
Then g is continuous at a if and only if fg is continuous at a. 

d) Suppose that f and g are defined and finite valued on R. If f and 
go f are continuous on R, then g is continuous on R. 


3.3.1. Use limit theorems to show that the following functions are continuous 


on [0,1]. 
= Je* cos(x? + 2x + 6) 
a) EO ee 10) 
3x2 —5x —2 
DB) f= x—2 a 
4 x=1 


3x x=0 
sin x 0 

d) r= ["e a 
x=0 


3.3.2. For each of the following, prove that there is at least one x € R which 
satisfies the given equation. 


ayes Sx? 
b) logx = cosx 


Cy a? oF 2 


3.3.3. 


3.3.4. 


3.3.5. 


3.3.6. 


3.3.7. 


3.3.8. 


3.3.9 |. 


3.3.10. 


3.3.11. 
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If f : [a,b] > Ris continuous, prove that sup, ¢y,4) | f()| is finite. 

If f : [a, b] > [a, b] is continuous, then f has a fixed point; that is, there 
is ac € [a, b] such that f(c) =c. 

If f is a real function which is continuous at a € R and if f(a) < M for 
some M e R, prove that there is an open interval J containing a such 
that f(x) < M forall x € J. 

Show that there exist nowhere continuous functions f and g whose sum 
f +g is continuous on R. Show that the same is true for the product of 
functions. 

Suppose that a € R, that J is an open interval containing a, that f, g : 
I — R, and that f is continuous at a. Prove that g is continuous at a if 
and only if f + g is continuous at a. 

Suppose that f : R — R satisfies f(x + y) = f(x) + fG) for each 
x,yeER. 


a) Show that f(nx) = nf (x) for allx € Randn € Z. 

b) Prove that f(qx) = qf (x) for allx e Randg €Q. 

c) Prove that f is continuous at 0 if and only if f is continuous on R. 

d) Prove that if f is continuous at 0, then there is an m € R such that 
f(x) = mx for allx ER. 


This exercise is used in Section 7.4. Suppose that f : R > (0,0) 
satisfies f(x + y) = f(x) f(y). Modifying the outline in Exercise 3.3.8, 
show that if f is continuous at 0, then there is ana € (0, oo) such that 
f() = a* for all x € R. (You may assume that the function a* is 
continuous on R.) 

If f :R — Ris continuous and 


lim f(x)= lm f(x) =0oo, 
x70 x7 —-CO 
prove that f has a minimum on R; that is, there is an x,, € R such that 
f(&m) = inf f(x) < ~w. 
xeER 


Let a > 1. Assume that a?t? = a?a‘ and (a?)4 = a?4 for all p,q € Q, 
and that a? < a? for all p,q € Q which satisfy p < gq. (This is easy, 
but tedious, to prove using algebra, induction, and the definitions a? = 
1, a” =1/a", and a/” — X/q™ forn € Nandm € Z. The hard part is 
proving that ~/a” exists, and this requires the Completeness Axiom— 
see Appendix A.10.) 

For each x € R, define 


A(x) := sup{a? :q € Qandg < x}. 
a) Prove that A(x) exists and is finite for all x € R, and that A(p) = a? 


for all p € Q. Thus a* := A(x) extends the “power of a” function 
from Q to R. 
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b) Ifx, y €e Rwith x < y, prove that a* <a’. 

c) Use Example 2.21 to prove that the function a* is continuous on R. 

d) Prove that a*tY = a*a’, (a*)? = a*”, anda™* = 1/a* for all x, 
yeR. 

e) For 0 < b < 1, define b* = (1/b)~*. Prove that c) and d) hold for 
b in place of a. State and prove an analogue of b) for b* and b” in 
place of a* anda’. 


3.4 UNIFORM CONTINUITY 


The following concept is very important and will be used many times in the rest 
of the book. 


3.35 Definition. 


Let E be a nonempty subset of R and f : E — R. Then ff is said to be 
uniformly continuous on E (notation: f : E > Ris uniformly continuous) if 
and only if for every ¢ > 0 there is ad > O such that 


|x—a|<6 and x,aeE imply |f(x)-—f@|<e. (9) 


Notice that the 6 in Definition 3.35 depends on « and f, but not on a 
and x. This issue needs to be addressed when we prove that a given func- 
tion is uniformly continuous on a specific set (e.g., by determining 5 before a 
is mentioned). 


3.36 EXAMPLE. 
Prove that f(x) = x? is uniformly continuous on the interval (0,1). 
Proof. Given « > 0, set 6 = ¢/2. If x, a € (0, 1), then |x + a| < |x| + |a| < 2. 
Therefore, if x, a € (0, 1) and |x — a| < 6, then 
lf (x) — f(@)| = |x? —a?| = |x —al |x +a| < 2|x —al < 25 =e. a 


The definitions of continuity and uniform continuity are very similar. In 
fact, the only difference is that for a continuous function, the parameter 6 may 
depend on a, whereas for a uniformly continuous function, 6 must be chosen 
independently of a. In particular, every function uniformly continuous on E is 
also continuous on E. The following example shows that the converse of this 
statement is false unless some restriction is made on E. 


3.37 EXAMPLE. 


Show that f(x) = x? is not uniformly continuous on R. 


Proof. Suppose to the contrary that f is uniformly continuous on R. Then 
there is a 6 > Osuch that |x —a| < 6 implies | f(x) — f(a)| < | forallx,a eR. 
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By the Archimedean Principle, choose n € N so large that nd > 1. Seta =n 
and x =n+6/2. Then |x — a| < 6 and 


$2 
1>If@) - f@/ =| -a*|=nd+ 7 > nd > 1 


This contradiction proves that f is not uniformly continuous on R. a 


Here is a key which unlocks the difference between continuity and uniform 
continuity. 


3.38 Lemma. 
Suppose that E C Rand that f : E — Ris uniformly continuous. If x, € E is 
Cauchy, then f (x) is Cauchy. 


Proof. Let « > 0 and choose 5 > 0 such that (9) holds. Since {x,} is Cauchy, 
choose N € N such that n,m > N implies |x, — x| < 6. Thenn,m > N 
implies | f (xn) — f (%m)| < €. a 


Notice that f(x) = 1/x is continuous on (0,1) and x, = 1/n is Cauchy but 
f (4m) is not. In particular, 1/x is continuous but not uniformly continuous on 
the open interval (0,1). Notice how the graph of y = 1/x corroborates this fact. 
Indeed, as a gets closer to 0, the value of 5 gets smaller (compare 6, to 69 in 
Figure 3.4) and hence cannot be chosen independently of a. 


f(x) + EF — 


(xo) - € 


Fx) +e 


flix) -€ 


5y Yo 6, 71 


FIGURE 3.4 
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Thus on an open interval, continuity and uniform continuity are different, 
even if the interval is bounded. The following result shows that this is not the 
case for closed, bounded intervals. (This result is extremely important because 
uniform continuity is so strong. Indeed, we shall use it dozens of times before 
this book is finished.) 


3.39 Theorem. Suppose that I is a closed, bounded interval. If f : I > Ris 
continuous on I, then f is uniformly continuous on I. 


Proof. Suppose to the contrary that f is continuous but not uniformly con- 
tinuous on J. Then there is an ¢€9 > O and points x,, y, € J such that 
IXn — Yn| < 1/n and 


If (Xn) — fOn)| = £0, neN. (10) 


By the Bolzano—Weierstrass Theorem and the Comparison Theorem, the 
sequence {x,} has a subsequence, say xn,, which converges, as k — oo, to 
some x € J. Similarly, the sequence {yn,}xen has a convergent subsequence, 
Say Yng;> which converges, as j — oo, to some y € J. Since Xnj, > X as 


j ~ cand f is continuous, it follows from (10) that | f(x) — f (y)| > €0; that 
is, f(x) # f(y). But |x, —y,| < 1/n for alln € Nso Theorem 2.9 (the Squeeze 
Theorem) implies x = y. Therefore, f(x) = f(y), a contradiction. a 


Our first application of this result is a useful but simple characterization of 
uniform continuity on bounded open intervals. (This result does NOT work for 
unbounded intervals.) 


3.40 Theorem. Suppose thata < band that f : (a,b) > R. Then f is uniformly 
continuous on (a, b) if and only if f can be continuously extended to [a, b]; that is, 
if and only if there is a continuous function g : [a,b] > R which satisfies 


f@)=ex), x (a,b). (11) 


Proof. Suppose that f is uniformly continuous on (a, b). Let x, € (a, b) con- 
verge to bas n — oo. Then {x,} is Cauchy; hence, by Lemma 3.38, so is 
{ f (%,)}. In particular, 


g(b) = him fn) 


exists. This value does not change if we use a different sequence to approxi- 
mate b. Indeed, let y, € (a, b) be another sequence which converges to b as 
n — oo. Given ¢ > 0, choose 6 > 0 such that (9) holds for E = (a, b). Since 
Xn — Yn — 0, choose N € N so that n > N implies |x, — y,| < 6. By (9), 
then, | fn) — fOn)| < ¢ for alln > N. Taking the limit of this inequality as 
n — oo, we obtain 


| lim f(%n)— lim f(yn) | sé 
noo noo 
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for all ¢ > 0. It follows from Theorem 1.9 that 
lim f(xn) = lim f(y). 
n—->Co n—->Co 


Thus, g(b) is well defined. A similar argument defines g(a). 

Set g(x) = f(x) for x € (a, b). Then g is defined on [a, 5], satisfies (11), and 
is continuous on [a, b] by the Sequential Characterization of Limits. Thus, f 
can be “continuously extended” to g as required. 

Conversely, suppose that there is a function g continuous on [a, b] which 
satisfies (11). By Theorem 3.39, g is uniformly continuous on [a, b]; hence, g 
is uniformly continuous on (a, b). We conclude that f is uniformly continuous 
on (a, D). | 


Let f be continuous on a bounded, open, nondegenerate interval (a, b). 
Notice that f is continuously extendable to [a, b] if and only if the one-sided 
limits of f exist at a and b. Indeed, when they exist, we can always define g at 
a and b to be the values of these limits. In particular, we can prove that f is 
uniformly continuous without using e’s and 6’s. 


3.41 EXAMPLE. 


Prove that f(x) = (x — 1)/logx is uniformly continuous on (0,1). 


Proof. It is clear that f(x) > 0 as x > 0+. Moreover, by I’H6pital’s Rule 
(see Theorem 4.27), 


1 
li = lim — = 1. 
eae FR) gap 1/x 


Hence f is continuously extendable to [0,1], so by Theorem 3.40, f is uni- 
formly continuous on (0,1). | 


EXERCISES 


3.4.0. Decide which of the following statements are true and which are false. 
Prove the true ones and provide counterexamples for the false ones. 


a) If f is uniformly continuous on (0, oo) and g is positive and bounded 
on (0, oo), then fg is uniformly continuous on (0, 00). 

b) The function x log(1/x) is uniformly continuous on (0,1). 

c) The function 


cos x 
mx +b 
is uniformly continuous on (0,1) for all nonzero m, b ER. 


d) If fg are uniformly continuous on an interval [a,b] and g(x) 4 0 for 
x € [a, b], then f/g is uniformly continuous on [a, b]. 
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3.4.1. 


3.4.2. 


3.4.3. 


3.4.4. 


3.4.5. 


3.4.6. 


Functions on R 


Using Definition 3.35, prove that each of the following functions is uni- 
formly continuous on (0,1). 


a) f(x) =x? 45x 
b) f(x) =x? +5x41 
c) f(x) =x? sin2x —2 


Prove that each of the following functions is uniformly continuous on 
(0,1). (You may use L’H6pital’s Rule and assume that sin x and log x are 
continuous on their domains.) 


cosx — l 


a) f(x)= 


x 
b) f(x) =x sin(x~7) 


c) f(x) = 2log(x*) 
] Xx 

d) f(x) = (1 + -) 
x 


Assuming that cosx is continuous on R, find all real @ such that 

x cos(1/x) is uniformly continuous on the open interval (0,1). 

a) Suppose that f : [0, co) > Ris continuous and that there isan L ¢ R 
such that f(x) > L asx — ov. Prove that f is uniformly continuous 
on [0, oo). 

b) Prove that f(x) = 1/ (x? + 1) is uniformly continuous on R. 


Suppose that w € R, that E isa nonempty subset of R, and that f, g : E > 
R are uniformly continuous on E. 


a) Prove that f + g andaf are uniformly continuous on E. 

b) Suppose that fg are bounded on E. Prove that fg is uniformly contin- 
uous on E. 

c) Show that there exist functions fg uniformly continuous on R such 
that fg is not uniformly continuous on R. 

d) Suppose that f is bounded on E and that there is a positive constant eo 
such that g(x) > ¢o for allx € E. Prove that f/g is uniformly continuous 
on E. 

e) Show that there exist functions fg, uniformly continuous on the inter- 
val (0,1), with g(x) > 0 for all x € (0, 1), such that f/g is not uniformly 
continuous on (0,1). 


a) Let J be a bounded interval. Prove that if f : 7 > R is uniformly 
continuous on /, then f is bounded on /. 

b) Prove that a) may be false if J is unbounded or if f is merely 
continuous. 


3.4.7. 


3.4.8. 


3.4.9. 
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Suppose that f is continuous on [a, b]. Prove that given ¢ > 0 there exist 
points x9 = a < x1 <--- < x, = b such that if Ey := {y : f(x) = y for 
some x € [xz_-1, xg]}, then sup EF, — inf FE, < e fork =1,2,...,n. 

Let E CR. A function f : E —> Ris said to be increasing on E if and 
only if x1,x2 € E and x; < x2 imply f(x) < f(x2). Suppose that f is 
increasing and bounded on an open, bounded, nonempty interval (a, b). 


a) Prove that f(a+) and f(b—) both exist and are finite. 

b) Prove that f is continuous on (a, b) if and only if f is uniformly con- 
tinuous on (a, b). 

c) Show that b) is false if f is unbounded. Indeed, find an increasing 
function g : (0, 1) ~ R which is continuous on (0,1) but not uniformly 
continuous on (0,1). 


Prove that a polynomial of degree n is uniformly continuous on R if and 
only ifn = 0or 1. 


CHAPTER 4 


Differentiability on R 


4.1 THE DERIVATIVE 


For many applications, we need to compute the slope of a tangent line to a curve. 
The following concept is useful in this regard. 


4.1 Definition. 


A real function f is said to be differentiable at a point a € Rif and only if f is 
defined on some open interval J containing a and 


jj 2 ee 


exists. In this case f’(a) is called the derivative of f at a. 


The assumption that f be defined on an open interval containing a is made so 
that the quotients in (1) are defined for all h 4 0 sufficiently small. 

You may recall that the graph of y = f(x) has a non-vertical tangent line at 
the point (a, f(a)) if and only if f has a derivative at a, in which case the slope 
of that tangent line is f’(a). To see why this connection makes sense, let us 
consider a geometric interpretation of (1). Suppose that f is differentiable at a. 
A secant line of the graph y = f(x) is a line passing through at least two points 
on the graph, and a chord is a line segment which runs from one point on the 
graph to another. Let x = a+h, and observe that the slope of the chord passing 
through the points (x, f(x)) and (a, f(a)) is given by (f(x) — f(a))/(x—a). Now, 
since x =a +h, (1) becomes 


Ay tOat@: 

xa x-—a 
Hence, as x — a the slopes of the chords through (x, f(x)) and (a, f(a)) 
approximate the slope of the tangent line of y = f(x) at x = a (see Figure 4.1), 
and in the limit, the slope of the tangent line to y = f(x) at x = a is precisely 
f'(@. Thus, we shall say that the graph of y = f(x) has a unique tangent line at 
a point (a, f(a)) if and only if f’(a) exists. 


112 
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y=f@) 


Tangent 


#Y 


FIGURE 4.1 


If f is differentiable at each point in a set E, then f’ is a function on E. This 
function is denoted several ways: 


d 
Dy f = a = f= fi 


When y = f(x), we shall also use the notation dy/dx or y’ for f’. Higher-order 
derivatives are defined recursively; that is, ifn ¢ N, then f @+D(q) := (f™Y'(a), 
provided these derivatives exist. Higher-order derivatives are also denoted 
several ways, including D” f, d" f/dx", f™, and by d"y/dx" and y™ when 
y = f(x). The second derivatives f (respectively, y°) are usually written 
as f” (respectively, y”), and when they exist at some point a, we shall say that f 
is twice differentiable at a. 

Here are two characterizations of differentiability which we shall use to study 
derivatives. The first one, which characterizes the derivative in terms of the 
“chord function” 

F(x) := Fx) — #@ x #a, (2) 
x-a 
will be used to establish the Chain Rule in Section 4.2. 


4.2 Theorem. A real function f is differentiable at some point a € R if and only 
if there exist an open interval I and a function F : I + R such thata € I, f is 
defined on I, F is continuous at a, and 


f(x) = F@)@ —a)+ f@ (3) 
holds for all x € I, in which case F(a) = f'(a). 
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Proof. Notice once and for all that for x € J \ {a}, (2) and (3) are equivalent. 
Suppose that f is differentiable at a. Then f is defined on some open interval 
I containing a, and the limit in (1) exists. Define F on J by (2) if x # a, and 
by F(a) := f’(a). Then (3) holds for all x € 7, and F is continuous at a by (2) 
since f’(a) exists. 

Conversely, if (3) holds, then (2) holds for all x € J, x #4 a. Taking the limit 
of (2) as x — a, bearing in mind that F is continuous at a, we conclude that 
F(a) = f'(a). | 


The second characterization of differentiability, in terms of linear approxi- 
mations |[i.e., how well f(a + h) — f(a) can be approximated by a straight line 
through the origin] will be used in Chapter 11 to define the derivative of a func- 
tion of several variables. 


4.3 Theorem. A real function f is differentiable at a if and only if there is a 
function T of the form T(x) := mx such that 


om LE tM—f@)—Th) _ 
im = 
h->0 h 


0. (4) 


Proof. Suppose that f is differentiable, and set m := f’(a). Then by (1), 


fath)—-f@—-Th)_ fat+h)—f@ 
h - h 


f'@—>0 


ash — 0. 
Conversely, if (4) holds for T(x) := mx andh 4 0, then 


flath)—fl@) _ fi@tPh) = f(@=mh 
h aes h 
Jey 7 a= 2p) 
- : 


=m+ 


By (4), the limit of this last expression is m. It follows that (f(a + h) — 
f(a))/h > m, as h — 0; that is, that f’(a) exists and equals m. a 


Our first application of Theorem 4.2 answers the question: Are differentiabil- 
ity and continuity related? 


4.4 Theorem. [f fis differentiable at a, then f is continuous at a. 


Proof. Suppose that f is differentiable at a. By Theorem 4.2, there is an open 
interval J and a function F, continuous at a, such that f(x) = f(a) + F(x) 
(x — a) for all x € J. Taking the limit of this last expression as x — a, we 
see that 


lim f(x) = f@) + F@)-0= faa). 


In particular, f(x) — f(a) as x — a; that is, f is continuous at a. a 
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Thus any function which fails to be continuous at a cannot be differentiable 
at a. The following example shows that the converse of Theorem 4.4 is false. 


4.55 EXAMPLE. 


Show that f(x) = |x| is continuous at 0 but not differentiable there. 


Proof. Since x — O implies |x| > 0, f is continuous at 0. On the other hand, 


since |h| = h when h > 0 and |A| = —h when h < 0, we have 
h)— h) — 
lim FH) ~ fO) = FO) =1 and lim Fe) ~ FO) =-l. 
h->0+ h h>0- h 


Since a limit exists if and only if its one-sided limits exist and are equal 
(Theorem 3.14), it follows that the limit in (1) does not exist when a = 0 
and f(x) = |x|. Therefore, f is not differentiable at 0. a 


This example reflects the conventional wisdom about the difference between 
differentiable and continuous functions. Since a function differentiable at a 
always has a unique tangent line at (a, f(a)), the graph of a differentiable 
function on an interval is “smooth” with no corners, cusps, or kinks. On the con- 
trary, although the graph of a continuous function on an interval is unbroken 
(has no holes or jumps), it may well have corners, cusps, or kinks. In particular, 
f(x) = |x| is continuous but not differentiable at x = 0 and the graph of y = |x| 
is unbroken but has a corner at the point (0, 0) (see Figure 4.2). 


Ay 


HY 


FIGURE 4.2 


By Definition 4.1, if f is differentiable at a, then f must be defined on an 
open interval containing a (i.e., on both sides of a). As with the theory of limits, 
it is convenient to define “one-sided” derivatives to deal with functions whose 
domains are closed intervals (see Example 4.7 below). Here is a brief discus- 
sion of what it means for a real function to be differentiable on an interval (as 
opposed to being differentiable at every point in an interval). This concept will 
be used in Sections 5.3, 5.6, and 11.1. 
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4.6 Definition. 


Let J be a nondegenerate interval. 
i) A function f : J > Ris said to be differentiable on J if and only if 


att TO 
es > eee 


exists and is finite for every a € I. 
ii) f is said to be continuously differentiable on I if and only if f; exists and is 
continuous on /. 


Notice that when a is not an endpoint of J, f;(a) is the same as f’(a). Because 
of this, we usually drop the subscript on f}. In particular, if f is differentiable 
on [a, b], then 


lath) —- f@ 
h 


fb+h)— fb) 
; ; 


‘(a):= li d f'(b):= li 
Fa a 


The following example shows that Definition 4.6 enlarges the collection of 
differentiable functions. 


4.7 EXAMPLE. 


The function f(x) = x*/? is differentiable on [0, 00) and f’(x) = 3./x/2 for all 
x € [0, co). 


Proof. By the Power Rule (see Exercise 4.2.7), f(x) = 3./x/2 for all x € 
(0, co). And by definition, 


2-9 
f ©) = lim = lim Jh=0. a 
h—-0+ h h->0+ 


Here is notation widely used in conjunction with Definition 4.6. Let J be 
a nondegenerate interval. For each n € N, define the collection of functions 
C"(I) by 


C"(1) :={f : f: 1 > Rand f™ exists and is continuous on /}. 


We shall denote the collection of f which belong to C” (J) for alln ¢ NbyC™ (J). 
Notice that C!(/) is precisely the collection of real functions which are continu- 
ously differentiable on J. When dealing with specific intervals, we shall drop the 
outer set of parentheses; for example, we shall write C”[a, b] for C” ([a, b]). 

By modifying the proof of Theorem 4.4, we can show that if f is differentiable 
on J, then f is continuous on J. Thus, C°(1) c C™() c C" (J) for all integers 
m>n> 0. 
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The following example shows that not every function which is differentiable 
on R belongs to C!(R). 


4.8 EXAMPLE. 
The function 


x* sin(1/x) x £0 


reo=|5 ps0 


is differentiable on R but not continuously differentiable on any interval which 
contains the origin. 


Proof. By definition, 


f’(O) = lim hsin (;) =0 and f’(x) = 2xsin (~) — cos (~) 
h>0 h x x 


for x #£ 0. Thus f is differentiable on R but lim,_,9 f’(x) does not exist. In 
particular, f’ is not continuous on any interval which contains the origin. Mf 


It is important to notice that a function which is differentiable on two sets is 
not necessarily differentiable on their union. 


4.9 Remark. f(x) = |x| is differentiable on [0, 1] and on [—1, 0] but not on 
[—1, 1]. 


Proof. Since f(x) = x when x > 0 and = —x when x < 0, it is clear that f is 
differentiable on [—1, 0) U (0, 1] [with f’(x) = 1 for x > O and f’(x) = —1 for 
x <0]. By Example 4.5, f is not differentiable at x = 0. However, 


lhl _ (hl 
‘ = a f = eS 
foo. = lim S> = 1 and f[-1,9) = lim >> = —1. 


Therefore, f is differentiable on [0, 1] and on [—1, 0]. | 


EXERCISES 


4.1.0. Suppose that f, g : [a,b] — R. Decide which of the following state- 
ments are true and which are false. Prove the true ones and provide 
counterexamples for the false ones. 


a) If f = g’ and f is differentiable on [a, b], then g is differentiable on 
(a, b). 

b) If f is differentiable on [a,b], then f is uniformly continuous on 
[a, b]. 
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4.1.1. 


4.1.2 


4.1.3. 


4.1.4. 


4.1.5. 


4.1.6. 


4.1.7. 


Differentiability on R 


c) If f is differentiable on (a,b) and f(a) = f(b) = 0, then f is uni- 
formly continuous on [a, b]. 

d) If f is differentiable on (a, b] and f(x)/(x —a) > lasx > a4, then 
f is uniformly continuous on (a, db]. 


For each of the following real functions, use Definition 4.1 directly to 
prove that f’(a) exists. 


a) f(x) =x7+5x-6,aER 
b) f@) =Vx4+1,a>-1 
c) f(x) =1/@—-1, a #1 


. This exercise is used in Section 4.2. 


a) Prove that (x”)! = nx"~! for every n € Nand every x ER. 
b) Prove that (x”)’ = nx"—! for every n € —NU{0} and every x € (0, 00). 


Suppose that 


1 

|x| sin — x #0 
x 

x= 0. 


Show that f,(x) is continuous at x = 0 whena > 0 and differentiable at 
x = 0 when a > 1. Graph these functions for a = 1 and a = 2 and give 
a geometric interpretation of your results. 

Let J be an open interval which contains 0 and f : J > R. If there 
exists ana > 1 such that | f(x)| < |x|® for all x € J, prove that f is 
differentiable at 0. What happens when a = 1? 


fa(x) = 


a) Find all points (a, b) on the curve C, given by y = x — 2cosx, so that 
the tangent lines to C at (a, b) are parallel to the line y = x — 2. 

b) Find all points (a, b) on the curve C, given by y = 5x” — 2x, so that 
the tangent lines to C at (a, b) pass through the point (0, —5). 


Define f on R by 


3x3+1 x >0 
LES 0 x <0. 
Find all n € N such that f exists on all of R. 
Suppose that f : (0,00) — R satisfies f(x) — f(y) = f(x/y) for all 
x,y € (0, co) and f(1) = 0. 


a) Prove that f is continuous on (0, oo) if and only if f is continuous 
at 1. 

b) Prove that f is differentiable on (0, oo) if and only if f is differen- 
tiable at 1. 

c) Prove that if f is differentiable at 1, then f’(x) = f’(1)/x for all 
x € (0, cc). 


[Note: If f’(1) = 1, then f(x) = log x.] 
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4.1.8. Let J be an open interval, f : 7 — R, andc ¢€ I. The function f is said 
to have a local maximum at c if and only if there is a 6 > 0 such that 
f(c) = f(x) holds for all |x — c| < 6. 


a) If f has a local maximum at c, prove that 


fle+t) — fc) = 
t 


f(e+u)— fle) ay) and 
u 


for u > 0 andt < 0 sufficiently small. 

b) If f is differentiable at c and has a local maximum at c, prove that 
7 (ce) = 0, 

c) Make and prove analogous statements for local minima. 

d) Show by example that the converses of the statements in parts b) and 
c) are false. Namely, find an f such that f’(0) = 0 but f has neither 
a local maximum nor a local minimum at 0. 


4.1.9. Suppose that 7 = (—a, a) for some a > 0. A function f : J > Ris said 
to be even if and only if f(—x) = f(x) for all x € J, and said to be odd 
if and only if f(—x) = —f(x) forall x e€/. 


a) Prove that if f is odd and differentiable on /, then f’ is even on /. 
b) Prove that if f is even and differentiable on /, then f’ is odd on /. 
4.2 DIFFERENTIABILITY THEOREMS 


In this section we prove several familiar results about derivatives. 


4.10 Theorem. Let f and g be real functions anda € R. If f and g are differen- 
tiable at a, then f + g, af, f -g, and [when g(a) # 0] f/g are all differentiable 
at a. In fact, 


(f+9)@= f'(a)4+ 8’), (5) 

(af)'(a) =af'(a), (6) 

(fg) (a =s@f (a+ f@g'(a), (7) 

( ) (q) = SOLO) = f@s'@) (@) 
8 g(a) 


Proof. The proofs of these rules are similar. We provide the details only 
for (7). By adding and subtracting f(a)g(x) in the numerator of the left side 
of the following expression, we can write 


fO)8Q) = fOs@ _ ey Le + fae = 8@): 


x—a md 
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This last expression is a product of functions. Since g is continuous (see The- 
orem 4.4), it follows from Definition 4.1 and Theorem 3.8 that 


li f(x)g(x) — f(a)g(a) 
im 


x7a X—a 


= g(a) f'(a) + f(a)g'(a). al 


Formula (5) is called the Sum Rule, (6) is sometimes called the Homogeneous 
Rule, (7) is called the Product Rule, and (8) is called the Quotient Rule. 
Next, we show what the derivative does to a composition of two functions. 


4.11 Theorem. [CHAIN RULE]. 
Let f and g be real functions. If f is differentiable at a and g is differentiable at 
f (a), then g o f is differentiable at a with 


(go f)'(@) =8'(f@)f'@. (9) 


Proof. By Theorem 4.2, there exist open intervals / and J, and functions F : 
I — R, continuous at a, and G : J — R, continuous at f(a), such that 


F(a) = f'(a), G(f@) = 8'(f@), 
f(x) = Fx) — a) + fla), xel, (10) 


and 


g(y) =GO)(y — f@) +8(f@), yeJ. (11) 


Since f is continuous at a, we may assume (by making / smaller if necessary) 
that f(x) € J forall x e€ J. 
Fix x € J. Apply (11) to y = f(x) and (10) to x to write 


(g 0 f)@) = g(f(x)) = GF @))(F() — F(@) + 8(f@) 
= G(f(x))FO)@ — a) + (g 0 f(a). 


Set H(x) = G(f(x))F(x) for x € I. Since F is continuous at a and G is 
continuous at f(a), it is clear that H is continuous at a. Moreover, 


H(a) =G(f@)F@=s(f@f@. 
It follows from Theorem 4.2, therefore, that (go f)'(a) = e'(f(a@)) f(a). Hf 


EXERCISES 


4.2.0. Suppose that J is an open interval containing a, and that f,g,h:I > R. 
Decide which of the following statements are true and which are false. 
Prove the true ones and provide counterexamples for the false ones. 


a) If f g, and /h are differentiable at a, then 


(fgh)'(a) = f'(a)g(ayh(a) + f(a)g'(ah(a) + flag ayh'(a). 


4.2.1. 


4.2.2. 


4.2.3. 


4.2.4. 
4.2.5. 
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b) If f is twice differentiable at a and g is twice differentiable at f(a), 
then 


(go f)"@=s(f@Sf'@+s"(f@(f'@). 
c) If the nth-order derivatives f (a) and g(a) exist, then 
Sto? @ =f" @+e°O: 


d) If the nth-order derivatives f(a) and g(a) exist and are 
nonzero, then 


en @) = LOLOM + CD" FOR) 
g - gttl(a) 


Suppose that f and g are differentiable at 0 and 1 with f’(0) =a, 
f') = 5b, 9/0) =c, and g’/(1) =d. If f() =0, fd) =2, g(0) = —5, 
and g(1) = 1, evaluate each of the following derivatives. 


a) (fg)'(1) 

b) (f/8)'(0) 
c) (go f)'(0) 
d) (fog) 


Suppose that f is differentiable at 3 and 9 with f(3) =2, f(9) = 1, 
f'B) =e, and f’(9) =z. 


a) If g(x) = x f(x”), find the value of g’(3). 
b) If g(x) = f7(,/%), find the value of g’(9). 
c) If g(x) = x/B3f(X/s)), find the value of g’(27). 


[PowER RuLE] Assume that (e*)’ = e* for x € R and (logx)’ = 1/x for 
x > 0. Use x® := e*!8* to prove that (x%)’ = ax%! for all x > 0 and 
alla ER. 

Using Exercise 4.1.2, prove that every polynomial belongs to C™®(R). 
Suppose that f is differentiable at a and f(a) £0. 


a) Show that for / sufficiently small, f(a +h) 40. 
b) [ReEcrIpROcAL RULE] Using Definition 4.1 directly, prove that 1/f (x) 
is differentiable at x = a and 


LY. f'@ 
Bg) aay 
f f*(@) 
c) Use the Product Rule and the Reciprocal Rule to prove the Quotient 
Rule directly. 
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4.2.6. Suppose that n € N and f,g are real functions of a real variable 
whose nth derivatives f™, g exist at a point a. Prove Leibniz’s 
generalization of the Product Rule: 


(f9@=)>> @ fO@s%?@). 


k=0 


4.2.7|. This exercise is used in Section 5.3. 


a) Prove that ifg =n/m forn € Zandm €N, then 
x” — aq” = (x7 — at)(x1"—) ne x1(m—2) sf Sate xqi m2) +4 qi) 


for every x,a € (0,00). 
b) [Power RuLE] Use Exercise 4.1.2 and part a) to prove that x? is 
differentiable on (0, oo) for every g € Q and that (x4)! = qx? = 


4.2.8. Assuming that e* is differentiable on R, prove that 


xX 


{Wes Leen ie 
0 


x=0 


is differentiable on [0, oo). Is f differentiable at 0? 


4.2.9. Using elementary geometry and the definition of sinx,cosx, we can 
show that for every x, y € R (see Appendix B) 
i) |sinx| < 1, |cosx| < 1, sin(O) = 0, cos(0) = 1, 
ii) sin(—x) = —sinx, cos(—x) = cos x, 
iii) sin? x + cos? x = 1, cosx = 1 —2sin? (5) ; 


iv) sin(x + y) = sinx cos y +cosx sin y. 


Moreover, if x is measured in radians, then 


v) cosx = sin(} — x), sinx = cos (4% — x), 
and 

. ; 4 

vi) 0 < xcosx < sinx <x, Vers: 


Using these properties, prove each of the following statements. 


a) The functions sinx and cosx are continuous at 0. 
b) The functions sinx and cosx are continuous on R. 
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c) The limits 


exist. 
d) The function sin x is differentiable on R with (sin x)’ = cos x. 
e) The functions cosx and tanx := sinx/cos.x are differentiable on R 


with (cos x)’ = — sinx and (tanx)! = sec? x. 


4.3 THE MEAN VALUE THEOREM 


The Mean Value Theorem makes a precise statement about the relationship 
between the derivative of a function and the slope of one of its chords. It 
was discovered by the following geometric reasoning. Suppose that f is 
differentiable on (a,b). Since the graph of f on (a,b) has a tangent at each 
of its points, it seems likely that the slope of the chord through the points 
(a, f(a)) and (b, f(b)) equals the slope f’(c) for some value of c € (a,b) 


(see Figure 4.3). 
AY 
Tangent 
Chord 
y=f@) 
| 
| 
| 
| 
| > 
a c b x 


FIGURE 4.3 


We begin with a special case. 


4.12 Lemma. [ROLLE’S THEOREM]. 
Suppose that a,b € Rwitha < b. If f is continuous on [a, b], differentiable on 
(a, b), and if f(a) = f(b), then f'(c) = 0 for some c € (a, b). 
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Proof. By the Extreme Value Theorem, f has a finite maximum M and a 
finite minimum m on [a, b]. If M =m, then f is constant on (a, b) and f’(x) = 
0 for all x € (a, b). 

Suppose that M 4 m. Since f(a) = f(b), f must assume one of the values 
M orm at some point c € (a, b). By symmetry, we may suppose that f(c) = 
M. [That is, if we can prove the theorem when f(c) = M, then a similar proof 
establishes the theorem when f(c) = m.] Since M is the maximum of f on 
[a, b], we have 


fie+h)— fc) <0 
for all h which satisfy c + h € (a, b). In the case h > 0 this implies 


Hijet LOEDETO s 


0, 
h>0+ h 


and in the case h < 0 this implies 


Meena so > 0, 


f = li 
Fhe) im 
It follows that f’(c) = 0. | 
Notice once and for all that the proof of Rolle’s Theorem proves a well-known 
result: The extreme values of a differentiable function on an open interval occur 


at critical points (i.e., at points where f’ is zero). 


4.13 Remark. The continuity hypothesis in Rolle’s Theorem cannot be relaxed 
at even one point in [a, b]. 


Proof. The function 


4G) = i x € [0, 1) 


x=1 


is continuous on [0, 1), differentiable on (0,1), and f(0) = f(1) = 0, but f’(x) 
is never zero. | 


4.14 Remark. The differentiability hypothesis in Rolle’s Theorem cannot be 
relaxed at even one point in (a, b). 


Proof. The function f(x) = |x| is continuous on [—1, 1], differentiable on 
(—1, 1) \ {0}, and f(—1) = f(J), but f’(x) is never zero. a 


We shall use Rolle’s Theorem to obtain several useful results. The first is a 
pair of “Mean Value Theorems.” 
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4.15 Theorem. Suppose that a,b € Rwitha < b. 


i) [GENERALIZED MEAN VALUE THEOREM] If f, g are continuous on [a, b] and 
differentiable on (a, b), then there is ac € (a, b) such that 


ge (c)(f(b) — fla) = f’()(g(b) — g(a). 


ii) [MEAN VALUE THEOREM] If f is continuous on [a,b] and differentiable on 
(a, b), then there is ac € (a, b) such that 


f(b) — fa) = f(b a). 


Proof. i) Set h(x) = f(x)(g(b) — g(a)) — g(x)(f() — f(a)). Since h(x) = 
f'(x)(g(b) — g(a)) — 9'(x)(f(b) — f(@), it is clear that h is continuous on 
[a, b], differentiable on (a, b), and h(a) = h(b). Thus, by Rolle’s Theorem, 
h'(c) = 0 for some c € (a, b). 

ii) Set g(x) = x and apply part i). (For a geometric interpretation of this 
result, see the opening paragraph of this section and Figure 4.3.) | 


The Generalized Mean Value Theorem is also called Cauchy’s Mean Value 
Theorem. It is usually essential when comparing derivatives of two functions 
simultaneously, using higher-order derivatives to approximate functions, and 
studying certain kinds of generalized derivatives (e.g., see Taylor’s Formula and 
V’H6pital’s Rule in the next section, and Remark 14.32). 

The Mean Value Theorem is most often used to extract information about 
f from f’ (see, e.g., Exercises 4.3.4, 4.3.5, and 4.3.9). Perhaps the best known 
result of this type is the criterion for deciding when a differentiable function 
increases. To prove this result, we begin with the following nomenclature. 


4.16 Definition. 
Let E be anonempty subset of Rand f: E > R. 


i) f is said to be increasing (respectively, strictly increasing) on E if and only 
ifx1,x2 € Eandx; < x.imply f(x1) < f (x2) [respectively, f(x1) < f(x2)]. 
ii) f is said to be decreasing (respectively, strictly decreasing) on E if and only 
ifx1, x2 € Eandx, < x.imply f(x) = f (x2) [respectively, f(x1) > f(x2)]. 
iii) f is said to be monotone (respectively, strictly monotone) on E if and only if 
f is either decreasing or increasing (respectively, either strictly decreasing 

or strictly increasing) on E. 


Thus, although f(x) = x? is strictly monotone on [0, 1], and on [—1, 0], it is 
not monotone on [—1, 1]. 

Monotone functions are important from both a theoretical and a practical 
point of view (e.g., see Theorem 5.34). Thus it will come as no surprise that the 
following result is very important and widely used. 
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4.17 Theorem. Suppose thata,b € R, witha < b, that f is continuous on [a, b], 
and that f is differentiable on (a, b). 


i) If f’(x) > 0 [respectively, f'(x) < 0] for all x € (a,b), then f is strictly 
increasing (respectively, strictly decreasing) on [a, b]. 
ii) If f(x) =0 for all x € (a, b), then f is constant on [a, b}. 
iti) If g is continuous on [a, b] and differentiable on (a, b), and if f'(x) = g'(x) 
for all x € (a, b), then f—g is constant on [a, b]. 


Proof. Leta < x1 < x2 < b. By the Mean Value Theorem, there is ac € (a, b) 
such that f (x2) — f(x1) = f’(c)(x2 — x1). Thus, f(x2) > f (x1) when f’(c) > 0 
and f(x2) < f(x1) when f’(c) < 0. This proves part i). 

To prove part ii), notice that if f’ = 0, then by the proof of part i), f is 
both increasing and decreasing, and hence constant on [a, b]. Finally, part ii1) 
follows from part ii) applied to f—g. 


Theorem 4.17i is a great result. It makes checking a differentiable function 
for monotonicity a routine activity. However, there are many nondifferentiable 
functions which are monotone. For example, the greatest integer function, 


f(x) =[x] :=n, n<x<n+l1,neZ, 


is increasing on R but not even continuous, much less differentiable. 

How badly can these nondifferentiable, monotone functions behave? The 
following result shows that, just like the greatest integer function, any function 
which is monotone on an interval always has left and right limits (contrast with 
Examples 3.31 and 3.32). This is a function analogue of the Monotone Conver- 
gence Theorem. 


4.18 Theorem. Suppose that f is increasing on [a, b]. 


i) Ifc € [a, b), then f (c+) exists and f(c) < f (c+). 
ii) Ifc € (a, b], then f (c—) exists and f (c—) < f(c). 


Proof. By symmetry it suffices to show that f(c—) exists and satisfies f(c—) < 
f(c) for any fixed ce (a,b]. Set E = f((a,c)) ands = supE£. Since f is 
increasing, f(c) is an upper bound of E. Hence, s is a finite real number 
which satisfies s < f(c). Given ¢ > 0, choose by the Approximation Property 
an xo € (a,c) such that s —e < f(xo) <s. Since f is increasing, 


s—e<f(m) <fx) <s 
for all x9 < x < c. Therefore, f(c—) exists and satisfies f(c—) =s < f(c). 
We have seen (Example 3.32) that a function can be nowhere continuous 


(i.e., can have uncountably many points of discontinuity). How many points 
of discontinuity can a monotone function have? 
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*4.19 Theorem. If f is monotone on an interval I, then f has at most countably 
many points of discontinuity on I. 


Proof. Without loss of generality, we may suppose that f is increasing. Since 
the countable union of at most countable sets is at most countable (Theorem 
1.42ii), it suffices to show that the set of points of discontinuity of f can be 
written as a countable union of at most countable sets. Since R is the union of 
closed intervals [—n,n], n € N, we may suppose that J is a closed, bounded 
interval [a, b]. 

Let E represent the set of points of discontinuity of f on (a, b). By Theo- 
rem 4.18, f(x—) < f(x) < f(x4+) for all x € (a, b). Thus, f is discontinuous 
at such an x if and only if f(x+) — f(x—) > 0. It follows that 


oC 
E=|J4;. 
j=l 


where for each j € N, Aj := {x € (a,b): f(x+) — f(x—) = 1/j}. We will 
complete the proof by showing that each A; is finite. 

Suppose to the contrary that Aj) is infinite for some jo. Set yo := jo( f(b) — 
f (a)) and observe that since f is finite valued on /, yo is a finite real number. 
On the other hand, since Aj, is infinite, then by symmetry we may suppose 
that there exist xj < x2 < ...in [a,b] such that f(x,+) — f(x,—) = 1/jo for 
k EN. Since f is monotone, it follows that 


fb) — fa) = YF RH) — fe) = . 


k=1 


that is, yo = jo( f(b) — f(a) = n for alln € N. Taking the limit of this last 
inequality asm — oo, we see that yo = +00. With this contradiction, the proof 
of the theorem is complete. | 


Theorem 4.171 can be used for less mundane tasks than finding intervals on 
which a given function is increasing. The following example shows how to use it 
to compare one function with another. 


4.20 EXAMPLE. 
Prove that 1+ x <e* forallx > 0. 


Proof. Let f(x) = e* — x, and observe that f’(x) = e* — 1 > 0 for all x > 0. 
It follows from Theorem 4.17i that f(x) is strictly increasing on (0, co). Thus 
e*—x = f(x) > f(0) =1 for x > 0. In particular, e* > x + 1 for x > 0. _| 


We close this section with some optional results which further explore the 
mean value concept. 
Our first result shows how (1 + x)® is related to 1+ ax. 
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*4.21 Theorem. [BERNOULLI’S INEQUALITY]. 
Let a be a positive real number. If0 < a < 1, then (1+x)* < 1+ eax for all 
x € [-1, 0), and ifa > 1, then (1+ x)* > 1+ eax forall x € [-1, ox). 


Proof. The proofs of these inequalities are similar. We present the details 
only for the case 0 < a < 1. Fixx > —1 and let f(t) = t®, t € [0, oo). Since 
f'(t) = at®—!, it follows from the Mean Value Theorem (applied to a = 1 and 
b=1+ x) that 


fd+x) — fQ) =axct! (12) 


for some c between 1 and 1+ x. 
Case 1. x > 0. Thenc > 1. Since 0 < a < 1 implies a — 1 < 0, it follows 
that c?—! < 1, hence xc?! < x. Therefore, we have by (12) that 


(1+x)* = f(l+x) = fC) t+axc%"! < fl) tax =1+ax (13) 
as required. 


Case 2. —1 < x < 0. Thenc < 1 soc%~! > 1. But since x < 0, it follows that 
xc%—! < x as before and we can repeat (13) to obtain the same conclusion. Mf 


We will now use Bernoulli’s Inequality to show once again that vague reason- 
ing can produce wrong conclusions. To see why, assuming that 


1\” 
lim (1 + *) 
n—>0co n 


exists, what do you think its limit is? Vague reasoning that well over half your 
class would agree with: Since 1 + 1/n gets near 1 and 1% = 1 for alla € R, the 
limit should be 1, right? Absolutely not. 


*4.22 EXAMPLE. 


Prove that the sequence (1 + 1/n)” is increasing, as n — on, and its limit L 
satisfies 2 < L < 3. (The limit Z turns out to be an irrational number, the 
natural base e = 2.718281828459---.) 


Proof. The sequence (1 + 1/n)” is increasing, since by Bernoulli’s Inequality, 


p\7/@+) 1 
(8) (4) 
n n+1 


To prove that this sequence is bounded above, observe by the Binomial For- 


mula that 
1 n n a 1 k 
1+-}) = ae ae 
DG) 
k=0 
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a LY" n= Dive kay 1), A 2 1 
k}\n} nk kl — kL ~ 2k-1 


for allk EN. It follows from Exercise 1.4.4c that 
1 1\" eS it 1 


for n > 1. Hence, by the Monotone Convergence Theorem, the limit L exists 
and satisfies 2 < L < 3. | 


Now, 


The last result in this section shows that although a differentiable function 
might not be continuously differentiable, its derivative does satisfy an interme- 
diate value theorem. (This result is sometimes called Darboux’s Theorem.) 


*4.23 Theorem. [INTERMEDIATE VALUE THEOREM FOR 
DERIVATIVES]. 

Suppose that f is differentiable on [a,b] with f'(a) # f'(b). If yo is a real 
number which lies between f'(a) and f'(b), then there is an xq € (a,b) such 
that f'(xo) = Yo. 


STRATEGY: Let F(x) := f(x) — yox. We must find an xo € (a,b) such that 
F'(xo) := f’(xo) — yo = 0. Since local extrema of a differentiable function 
F occur only where the derivative of F is zero (e.g., see the proof of Rolle’s 
Theorem), it suffices to show that F has a local extremum at some x9 € (a, D). 


Proof. Suppose that yo lies between f’(a) and f’(b). By symmetry, we may 
suppose that f’(a) < yo < f(b). Set F(x) = f(x) — yox for x € [a,b], 
and observe that F is differentiable on [a, b]. Hence, by the Extreme Value 
Theorem, F has an absolute minimum, say F(x9), on [a,b]. Now F’(a) = 
f'@— yo < 0,so F(a+h) — F(a) < 0 forh > 0 sufficiently small. Hence 
F(a) is NOT the absolute minimum of F on [a, b]. Similarly, F(b) is not the 
absolute minimum of F on [a, b]. Hence, the absolute minimum F(xo9) must 
occur on (a, b); that is, x9 € (a, b) and F’(xo) = 0. | 


EXERCISES 


4.3.0. Suppose that f, g : [a,b] — R. Decide which of the following state- 
ments are true and which are false. Prove the true ones and provide 
counterexamples for the false ones. 


a) If f and g are increasing on [a, b], then f + g is increasing on [a, b]. 

b) If f and g are increasing on [a, b], then fg is increasing on [a, b]. 

c) If f is differentiable on (a, b) and lim,_,,+ f(x) exists and is finite, 
then for each x € (a, b) there is ac between a and x such that f(x) — 
f@h = f'O@—a). 

d) If f and g are differentiable on [a, b] and | f’(x)| < 1 < |g’(x)| for 
all x € (a, b), then | f(x) — f(a)| < |g(x) — g(q@)| for all x € [a, D]. 
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4.3.1. 


4.3.2. 


4.3.3. 


4.3.4. 


4.3.5. 


4.3.6. 


4.3.7. 


4.3.8. 


4.3.9. 


4.3.10. 


4.3.11. 


*4.3.12. 


Differentiability on R 


Prove that each of the following inequalities holds. 


a) 7x + 0.38 < e?* forall x > 1. 

b) logx < 2,/x — 1.99 for all x > 1. 
c) (sin? x — 1)/2 < |x| forall x ER. 
d) cosx + 1 < e* forall x > 1. 


Suppose that J = (—1, 1), that f is continuous at x = —1 and x = 1, 
and that f is differentiable on J. If f(—1) = 4 and f(1) =5S, prove that 
1/2e f'W). 

Let f be a real function and recall that anr € R is called a root of 
a function f if and only if f(r) = 0. Show that if f is differentiable 
on R, then its derivative f’ has at least one root between any two 
roots of f. 

Suppose that a < b are extended real numbers and that f is differen- 
tiable on (a,b). If f’ is bounded on (a, b), prove that f is uniformly 
continuous on (a, b). 

Suppose that f is differentiable on R. If f(0) = 1 and | f’(x)| < 1 for 
all x € R, prove that | f(x)| < |x| +1 forallx ER. 

Suppose that f is differentiable on (a,b), continuous on [a, b], and 
that f(a) = f(b) = 0. Prove that if f(c) 4 0 for some c € (a,b), 
then there exist x;, x2 € (a,b) such that f’(x;) is positive and f’(x2) is 
negative. 

Suppose that f is continuous on [a, b] and that 


F(x) := sup f ({a, x). 


Prove that F is continuous on [a, D]. 

Suppose that f is twice differentiable on (a, b) and that there are points 
x1 < x2 < x3in (a, b) such that f(x,) > f(x2) and f(x3) > f (x2). Prove 
that there is a point c € (a, b) such that f”(c) > 0. 

Suppose that f is differentiable on (0, oo). If L = limy..5 f’(x) and 
limy+oo f(n) both exist and are finite, prove that L = 0. 

Suppose that (a, b) is an open interval, that f : (a,b) — R is differ- 
entiable on (a, b), and that xo € (a, b) is a proper local maximum of f 
(see Exercise 4.1.8). 


a) Prove that given 5 > 0, there exist xj < x9 < x2 such that f’(x1) > 
0, f’(x2) < 0, and |x; — xo] < 6 for j = 1,2. 

b) Make and prove an analogous statement for a proper local mini- 
mum. 


Suppose that f : [a,b] — Ris continuous and increasing. Prove that 
sup f(E) = f (sup E) for every nonempty set E C [a, b]. 

Suppose that f is differentiable at every point in a closed, bounded 
interval [a, b]. Prove that if f’ is increasing on (a, b), then f’ is contin- 
uous on (a, D). 
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4.4 TAYLOR’S THEOREM AND I’HOPITAL'S RULE 


In this section we use the Generalized Mean Value Theorem to obtain informa- 
tion about approximation. 

To motivate the first result, notice by the Mean Value Theorem that if f is 
differentiable on (a, b), then for any points x, xo € (a, b), there is a c between x 
and xg such that 


f(x) = fo) + fC) — x0), x € (a,b). 


Thus we have precise information about how closely f(x) can be approximated 
by the constant function y = f(xo). Clearly, the values f(x) of a function 
whose graph bends at the point (xo, f(xo)) cannot be closely approximated by 
a constant function unless x is near x9. But a constant function is a polyno- 
mial of degree 0. If we used polynomials of higher degree (whose graphs do 
curve), might we be able to approximate f(x) even when x is not so close to x9? 
In fact, the next result contains precise information about how closely f(x) can 
be approximated by a certain polynomial of degree n. (To understand how Tay- 
lor discovered this result, see the proof of Theorem 7.39.) 


4.24 Theorem. [TAYLOR’S FORMULA]. 

Letn € N and let a, b be extended real numbers witha < b. If f : (a,b) > R, 
and if f@* exists on (a, b), then for each pair of points x, xo € (a, b) there is 
a number c between x and xo such that 


(n+1) 
‘i fe gag hie ONO achat, 


f(x) = fo) + ys ea 


Proof. Without loss of generality, suppose that x9 < x. Define 


yer! 


a t = = f@) k 
FO>= ar and G(t):= f(x) — f(t) dX a 8) 


for each t € (a,b), and observe that the theorem will be proved if we can 
show that there is a c between x and xo such that 


G(xo) = F(x) - fF). (14) 
This looks like a job for the Generalized Mean Value Theorem. 


To verify that F and G satisfy the hypotheses of the Generalized Mean 
Value Theorem, notice that 


ki e 


(k) (k+1) (k) 
a (2 Mo-nt)at Ope nige SOO ga. sie 
dt k! (k — 1)! 
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for t € (a, b) andk €N. Telescoping, we obtain 


ferrg 
7 n! 


G'(t) = (x —t)” 


for t € (a, b). On the other hand, by the Chain Rule 


(x — ty” 
n! 


F(t)=- 


fort € R. Thus F and G are differentiable on (xo, x), continuous on [xo, x], 
and satisfy 


G'(t) 


— (ntl) 
Fn) ~ f (t), tA#x. (15) 


By the Generalized Mean Value Theorem, there is a number c € (x0, x) 
such that 


(F(x) — F(x0))G"(c) = (G(x) — G(xo)) F'(c). (16) 


Since F(x) = G(x) = 0 and x F ec, it follows that —F(xo)G’(c) = 
—G(xo) F’(c); that is, G(xo) = F(x) - G’'(c)/F’(c). We conclude by (15) that 
(14) holds, as promised. be 


We shall use this result in Chapter 7 to show that most of the functions 
you’ve used in calculus classes before are very nearly polynomials themselves. 
To lay some ground work for these results, we introduce some additional 
notation. 

Define 0! = 1 and f(x) = f(x), and notice that f(x9) = f(xo)/0!. We 
shall call 


, ne ¢tk) 
B= LEO) (a= aiyh 


k) 
k=0 


the Taylor polynomial of order n generated by f centered at xo. Clearly, for each 
f € C™(a, b), Taylor’s Formula gives us an estimate of how well Taylor polyno- 
mials approximate /f. In fact, since Taylor’s Formula implies 


f*D (ce) 


_ pFxo 
|f(x) — Pa’ (x) <= @+D! 


(x —x)"t], (17) 


for some c between x and xo and the fraction 1/(n + 1)! gets smaller as n gets 
larger, we see that when the derivatives of f are bounded, the higher-order 
Taylor polynomials approximate f better than the lower-order ones do. 

Let’s look at two specific examples to see how this works out in practice. 
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4.25 EXAMPLE. 

Let f(x) =e* andneN. 

a) Find the Taylor polynomial P,, := Pf oe 
b) Prove that if x € [—1, 1], then 


le“ — Pa(x)| S 


(a+)! 
c) Find ann so large that P, approximates e* on [—1, 1] to four decimal places. 


Proof. a) Since f(x) = e* for all x € Randk = 0,1,..., it is clear that 
f )(Q) = 1 for all k > 0; that is, that 


P? (x) = "Te 


k=0 


(18) 


b) Let c,x € [—1, 1]. Clearly, |e°| < e! < 3 and |x"| < 1 foralln EN. 
But if c lies between x and 0, then c € [—1, 1]. Thus it follows from (17) that 
le* — Pa(x)| < |eex"*"|//@2+ 1! < 3/@+4 1). 

c) To get four-place accuracy, we want |e* — P,(x)| < .00005. By part b), 
this will hold when 3/(n + 1)! < 0.00005; that is, when (n + 1)! > 60, 000. 
According to my calculator, this occurs when n + 1 > 9, so setn = 8. a 


4.26 EXAMPLE. 
Let f(x) = sinx andn EN. 


a) Find the Taylor polynomial P2,41 := Poe 
b) Prove that if x € [—1, 1], then 


: 1 
|sinx — Pon41(x)| S Qn+D! 


c) Find ann so large that P2,41 approximates sin x on [—1, 1] to three decimal 
places. 


Proof. a) Observe that f(x) = sinx, f’(x) = cosx, f"(x) = —sinx, 
f(x) = —cosx, and f(x) = sinx, right back where we started from. 
Thus it is clear that f?%(«) = (—1)' sinx and f@+)(x) = (—1)* cos x for 
k = 0,1,.... It follows that f?"(0) = 0 and f+) (0) = (—1)* for k > 0; 
that is, that 


n Ee 1)kx2k+1 


PHO) = So (19) 
<= (2k +1)! 


b) Let c, x € [—1, 1]. Clearly, | f2”+?(c)| < 1 and |x2"+?| < 12"+? = 1 for all 
n EN. Thus it follows from (17) that | sinx — Poy41(x)| < 1/(Qn +4 2)!. 
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c) To get three-place accuracy, we want |sinx — Pon+2(x)| < .0005. By 
part b), this will hold when 1/(2n+2)! < 0.0005; that is, when (2n +2)! > 2000. 
According to my calculator, this occurs when 2n + 2 > 7, so set n = 3. a 


The next result is a widely known technique for evaluating limits of the form 
0/0 or co/oo. Since it involves using information about derivatives to draw con- 
clusions about the functions themselves, it should come as no surprise that the 
proof uses the Mean Value Theorem. (Notice that our statement is general 
enough to include one-sided limits and limits at infinity.) 


4.27 Theorem. [L’HOPITAL’S RULE]. 

Let a be an extended real number and I be an open interval which either con- 
tains a or has a as an endpoint. Suppose that f and g are differentiable on I\{a} 
and that g(x) 40 4 g'(x) forall x € I\{a}. Suppose further that 


A:= lim f(x) = lim g(x) 
pee ede 


is either 0 or oo. If 


/ 
x 
B:= lim a ) 
xa B(x) 
exists as an extended real number, then 


tim 2. = tim 2 


= Wl . 
xa g(x) = ¥ >a gl(x) 


Proof. Let x, € I be distinct points with x, —> a as k — oo such that either 
X¢ <a or x, >a for allk € N. By the Sequential Characterization of Limits 
and by the characterization of two-sided limits in terms of one-sided limits, it 
suffices to show that f(xz)/g(x,) > Bask > oo. 

We suppose for simplicity that B € R. (For the cases B = tov, see Exer- 
cise 4.4.10.) Notice once and for all, since g’ is never zero on J, that by Mean 
Value Theorem the differences g(x) — g(y) are never zero forx,y EJ, x #y, 
provided either x, y > a or x, y < a. Hence, we can divide by these differ- 
ences at will. 

Case 1. A=Oanda eR. Extend f and g to / U {a} by f(a) := 0 =: g(a). 
By hypothesis, f and g are continuous on / U {a} and differentiable on /\ {a}. 
Hence by the Generalized Mean Value Theorem, there is a cy between x, and 
y := a such that 


fand—fO) _ f'cx) 


= ; (20) 
B(x) — 80) 8"(ck) 
Since f(y) = g(y) = 0, it follows that 
fxk) — fOK)— Ff) _ fx) (21) 


exe) g(x) -— 80) (ck) 
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Let k — oo. Since c; lies between x; and a, cy; also converges to a as k > oo. 
Hence hypothesis and (21) imply f (xz)/g(x,) > Bask > oo. 

Case 2. A = +o0 anda € R. We suppose by symmetry that A = +o. For 
each k,n € N, apply the Generalized Mean Value Theorem to choose a cx.y 
between x; and x, such that (20) holds for x, in place of y and cx,» in place of 
cx. Thus 


Fon) _ FO) _ FO =FO0 _ 1 ge) — acy Lem 
Blin) Bn) (Xn) (Xn) : "8 (Chan) 


_ (1 ay) f' (k,n). 
(Xn) 8! (Ckn) , 


Fn) _ f%K) — gre) fF’ (ckn) | f'(Ck.n) 

&(Xn) 8 (Xn) &(Xn) 8 (Ckn) 8 (Ck.n) ; 
Since A = o, it is clear that 1/g(x,) ~ Oasn — o, and since cz,» lies 
between x; and x, it is also clear that cy, — a, ask,n — oo. Thus (22) 
and hypothesis should imply that f(x,)/g@,) ~ 0 -0+ B = B for large 
nandk. Specifically, let 0 < ¢ < 1. Since cy, > aask,n — ow, choose 
an No so large that n > No implies | f’(cNo,n)/g’(CNo,n) — Bl < &/3. Since 
(Xn) — ©, choose an N > No such that | f(xx,)/g(xn)| and the product 
|g (XN) /8(Xn)| - | f"(CNo,n)/8'(CNo,n)| are both less than ¢/3 for alln > N. It 
follows from (22) that for any n > N, 


that is, 


(22) 


7 Nga) 
&'(CNo,n) 


B 


=< €. 


fQn) _ B f no) 
g(Xn) ~ | g(%n) 


a f'(€No,n) 
B(Xn)  8"(CNo,n) 


Hence, f(%n)/g(%n) ~ Basn > oo. 

Case 3. a = +00. We suppose by symmetry that a = +00. Choose c > 0 
such that J D (c, oo). For each y € (0, 1/c), set d(y) = f(1/y) and w(y) = 
g(1/y). Notice that by the Chain Rule, 


oy) _ FAP -My) _ CLs) 
wy) g’A/y)(-1/y?)_— gy)" 
Thus, for x = 1/y € (c, 00), f’(x)/g/(x) = $'(y)/W’(y). Since x — oo if and 


only if y = 1/x — 0+, it follows that ¢ and yw satisfy the hypotheses of Cases 1 
or 2 fora = 0 and J = (0, 1/c). In particular, 


EG). nay o'(y) ian oY) _ lim f(x) 
x70 g/(x) yo 0+ W/(y) ~ y+ VO) x00 g(x) 


L’H6pital’s Rule can be used to compare the relative rates of growth of two 
functions. For example, the next result shows that as x > oo, e* converges to 
oo much faster than x” does. 
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4.28 EXAMPLE. 


Prove that limy_,.. x*/e* = 0. 


Proof. Since the limits of x*/e* and x/e* are of the form 00/00, we apply 
VH6pital’s Rule twice to verify 


2 


; _ 2x 
lim —= lim —= lim — =0. a 
x00 e* x00 e* x00 et 


For each subsequent application of |’H6pital’s Rule, it is important to check 
that the hypotheses still hold. For example, 


a ox 
lim 5 - = lim =041= lm ———. 
x>0 x- + sinx x0 2x + cosx x>0 2 —sinx 


Notice that the middle limit is not of the form 0/0. 
l’H6pital’s Rule can be used to evaluate limits of the form 0 - co = —0(—oo). 


4.29 EXAMPLE. 


Find lim,_.04 x log x. 


Solution. By writing x as 1/(1/x), we see that the limit in question is of the form 
co/oo. Hence, by I’H6pital’s Rule, 


I 1 
Meee Sci ee a 


x—>0+ x>0+ 1/x x3 0+ —1/x? 


The next two examples show that I’H6pital’s Rule can also be used to evaluate 
limits of the form 1° and 0°. 


4.30 EXAMPLE. 
Find L = limy.94(1 + 3x)!/*. 
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Solution. If the limit exists, then by a law of logarithms and the fact that log x 
is continuous, we have logL = lim,-.9;4 log(1 + 3x)/x. Thus it follows from 
PH6pital’s Rule and the Chain Rule that 


log +3 3/143 
logL = lim se A 3 ee es a) ae 
x—>0+ x x—>0+ 1 
In particular, the limit exists by l’Hdpital’s Rule and L = e!°8/ = e?. a 


4.31 EXAMPLE. 


Find L = lim,-_, 1, (log x)!~*. 


Solution. If the limit L > 0 exists, then log L = lim,_,;(1 — x) log log x is of the 
form 0 - oo. Hence, by |’Hdpital’s Rule, 


.  loglogx _ 1/@logx) _ —20-x) 
log L = lim ———— = lim = lim =0 
x>11/(1—x) x>11/( x)? x>1 1+logx 


Therefore, the limit exists by l’H6pital’s Rule and L = e® = 1. | 


EXERCISES 


4.4.0. Decide which of the following statements are true and which are false. 
Prove the true ones and provide counterexamples for the false ones. 


a) x/logx > Oasx > 0. 

b) Ifn EN, then sin(1/x)/x” > 0asx > oo. 

c) xl8* -5 Oasx > 04. 

d) If there is a 8 > O such that f’(x) > £ for all x € (0,00), then 
x*/f (x) > 00 aS x > 00. 


4.4.1. Let f(x) =cosx andn EN. 


a) Find the Taylor polynomial P , := Pp . 
b) Prove that if x € [—1, 1], then 


|cosx — Prn(x)| < 


(Qn +1)!" 


c) Find an n so large that P2, approximates cosx on [—1, 1] to seven 
decimal places. 
4.4.2. Let f(x) =logx andn EN. 


a) Find the Taylor polynomial P,, := Pf a 
b) Prove that if x € [1, 2], then 


1 
1 — P, < ——_. 
| log x n(x)| < AE 
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4.4.3. 


4.4.4. 


4.4.5. 


4.4.6 


Differentiability on R 


c) Find ann so large that P, approximates log x on [1, 2] to three decimal 
places. 


Prove that 
x2 x” 
l+x+—+---4+—<e" 
2! n! 
for every x > O and everyn EN. 
Prove that 
Pe) " Pe x4n-l a3 - ) x4ntl 
S88 < sin oe of 
ee ai. (4n — 1)! ecient " Gn+D! 


for everyn € Nand x € (0,7). 
Evaluate the following limits. 
Dae 
a) lim —~———_ 
x0 cos2(3x) — 1 
e* +sinx — 1 


b) lim 
x—0+ log 2* 


x 
c) lim (=) 
x>0\1—cosx 
d) lim (x? — 5x)* 
x—>0t 


: TX 
e) lim ——__—___ 
*9 gos (2x5) 


f) lim |1 wa 
) Tim, |log(x)| 


x3 -4-—Vx3 

g) lim 
X00 xe+1- 4x3 
h) lim xX+4—-—JVx4+3 


x00 /x +2—JSx+1 


. This exercise is used in Sections 5.4, 6.3, and elsewhere. Let a > 0 and 


recall that (x%)’ = ax®~! and (logx)/ = 1/x for all x > 0. 


a) Prove that logx < x® for x large. Prove that there exists a constant 
Cy such that logx < Cyx® for all x € [1, 00), Cy ~ coasa —> 04, 
and Cy > Oasa > ~. 

b) Obtain an analogue of part a) valid for e* and x° in place of log x and 


x 
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4.4.7|. This exercise is used in Sections 7.4 and 12.5. Assume that e* is differen- 
tiable on R with (e*)’ = e’. 


a) Show that the following function is differentiable on R with 


f'(0) =0: 


ent? x #0 
FO= 19 x=0. 


b) Do analogous statements hold for f(x) when n = 2, 3, ...? 


4.4.8. Suppose that n € N is odd and f™) exists on [a,b]. If f(a) = 
f(b) =0 for allk =0,1,...,n —1 and f(c) 4 0 for some c € (a,b), 
prove that there exist x1;,x2 € (a,b) such that f ™ (x1) is positive and 
f™ (x2) is negative. 


4.4.9. a) Prove that |6 + sin(S + r)| < 63/3! for allO0 <6 <1. 
b) Prove that if |x — | <6 < 1, then |x + sinx — 2| < 63/3!. 


4.4.10. Prove l’H6pital’s Rule for the case |B| = oo by first proving that 
2@(x)/f (x) > Owhen f(x)/g(x) > +00, asx > a. 

4.4.11. Suppose that f and g are differentiable on an open interval J and that 
a € Reither belongs to J or is an endpoint of 7. Suppose further that g 
and g’ are never zero on J \ {a} and that 


i f(x) 
im —— 


xa g(x) 


is of the form 0/0. If there is an M € R such that | f’(x)/g/(x)| < M for 
all x € I \ {a}, prove that | f(x)/g(x)| < M for all x € I \ {a}. 
Is this result true if the limit of f(x)/g(x) is of the form co/0o? 


4.5 INVERSE FUNCTION THEOREMS 


In this section, we explore the continuity and differentiability of inverse 
functions. 

Recall that f : X — Y has an inverse function f~! if and only if f is 1-1 
and onto (Theorem 1.30), in which case f~!(f(x)) = x for all x € X and 
f(f-'(y)) = y for all y € Y. Since (x, f(x)) = (f7'(y), y), this means that 
the graph of y = f~!(x) is a reflection of the graph of y = f(x) about the line 
y = x (see Figure 4.4). In particular, it is not difficult to imagine that f~! is as 
smooth as f. This is the subject of the next two results. 


4.32 Theorem. Let I be a nondegenerate interval and suppose that f : I > Ris 
1-1. If f is continuous on I, then J := f (1) is an interval, f is strictly monotone on 
I, and f~' is continuous and strictly monotone on J. 


Proof. Since f is 1-1 from I onto J, Theorem 1.30 implies that f~! exists and 
takes J onto J. 
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FIGURE 4.4 


To show that J is an interval, since J contains at least two points, so does J. 
Let c,d € J withc < d. By the definition of an interval, it suffices to prove 
that every yo € (c, d) belongs to J. Since f takes J onto J, there exist points 
a,b € I such that f(a) = c and f(b) = d. Since yo lies between f(a) and 
f(b), we can use the Intermediate Value Theorem to choose an x9 between a 
and b such that yo = f(xo). Since x9 € J and f takes J onto J, yo = f (xo) 
must belong to J, as required. 

Suppose that f is not strictly monotone on J. Then there exist points 
a,b,c € I such that a < c < b but f(c) does not lie between f(a) and 
f(b). Since f is 1-1, f(a) # f(b), so by symmetry we may suppose that 
f(a) < f(b). Since f(c) does not lie between f(a) and f(b), it follows that 
either f(c) < f(a) < f(b) or f(a) < f(b) < f(c). Hence by the Interme- 
diate Value Theorem, there is an x; € (a, b) such that either f(x,;) = f(a) or 
f(x1) = f(b). Since f is 1-1, we conclude that either x; = a or x; = b, both 
contradictions. Therefore, f is strictly monotone on J. 

By symmetry, suppose that f is strictly increasing on J. To prove that f~! is 
strictly increasing on J, suppose to the contrary that there exist y;, y2 € J such 
that y; < y2 but f-!(y1) = f-'G2). Then x; := f~'(y1) and x2 := f—!(y2) 
satisfy x; > x2 and x1,x2 € I. Since f is strictly increasing on /, it follows 
that y) = f(x1) > f(x2) = y2, a contradiction. Thus, f~! is strictly increasing 
on J. 

It remains to prove that f—! is continuous from the left and from the right 
at each yo € J. We will provide the details for continuity from the right. To 
this end, suppose that f—! is not continuous from the right at some yo € J; 
that is, that there exist y, € J such that y, > yo, yx > yo aSn > @, 
but that 


f'On) = a0 > f7' 0) (23) 


for some number ap. Since / is an interval and f~! takes J onto J, it follows 
that ag belongs to J and there is a bp € J such that ag = f—!(bo). Substituting 
this into (23), we see that f-'(n) = f7'(bo) > f7!(y0). Since f is strictly 
increasing, we conclude that y, > bo > yo; that is, y, cannot converge to yo, 
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a contradiction. A similar argument verifies that f—! is continuous from the 
left at each yg € J. Thus f~! is continuous on J. a 


Our final result addresses the differentiability of an inverse function. 


4.33 Theorem. [INVERSE FUNCTION THEOREM]. 
Let I be an open interval and f : I > R be 1-1 and continuous. If b = f (a) 
for some a € I and if f'(a) exists and is nonzero, then f—' is differentiable at b 


and (f~')'(b) = 1/f"(a). 


Proof. By Theorem 4.32, f is strictly monotone, say strictly increasing on /, 
and f~! exists and is both continuous and strictly increasing on the range 
f (1). Moreover, since a := f~'(b) € I and / is open, we can choose c,d € R 
such that a € (c,d) C I. 

Let Eo be the range of f on (c, d); that is, Eo = f((c, d)). By Theorem 4.32, 
Ey must be an interval. Since f is strictly increasing, it follows that Eg = 
(f(c), f(d)). Hence, we can choose 6 > 0 so small that 0 < |h| < 6 implies 
b+h€ Ep. In particular, f~!(b + h) is defined for all 0 < |h| < 6. 

Fix such anh and set x = f~!(b+h). Observe that f (x)— f(a) =b+h—b=h. 
Since f—! is continuous, x — a if and only if h — 0. Therefore, by direct 
substitution, we conclude that 


_ fO+h— fb) xa 
lim = lim = ; 
h—>0 h x>a f(x)— fla) f'i@ 


This theorem is usually presented in elementary calculus texts in a form more 
easily remembered: If y = f(x) andx = f—'(y), then 


dx 1 


dy dy/dx’ 


Notice that, by using this formula, we do not need to solve explicitly for f~! to 
be able to compute (f~!)’. 


4.34 EXAMPLE. 


lf fa" bat ee? x? ee 1, prove that f(x) exists at x = 6 and find a 
value for (f~!)’(6). 


Solution. Observe that f(1) = 6 and f’(x) > 0 for all x > 0. Thus f is strictly 
increasing on (0, oo), and hence 1-1 there. 

Let J = (0,2), a=1,andb =6. Then f(a) = band f’(a) = 15 £0. Hence, it 
follows from the Inverse Function Theorem that (f~!)’(6) = 1/f’(1) = 1/15. 
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EXERCISES 


4.5.0. 


4.5.1. 


4.5.2. 


4.5.3. 


4.5.4. 


4.5.5. 


Decide which of the following statements are true and which are 
false. Prove the true ones and provide counterexamples for the 
false ones. 


a) Suppose that J C Ris nonempty. If f : J > Ris 1-1 and continuous, 
then f is strictly monotone on J. 

b) Suppose that J is an open interval which contains 0 and that f : 
I +R is 1-1 and differentiable. If f and f’ are never zero on /, 
then the derivative of f~! has at least one root in f (J); that is, there 
is ana € J such that (f~!)/(a) =0. 

c) Suppose that f and g are 1-1 on R. If f and go f are continuous on 
R, then g is continuous on R. 

d) Suppose that / is an open interval and that a € J. Suppose further 
that f : 7 > Rand g: f(/) > Rare both 1-1 and continuous and 
that b := f(a). If f’(a) and g’(b) both exist and are nonzero, then 
(g o f)~!(x) is differentiable at x = g(b), and ((g 0 f)~!)/(g(b)) = 
(f'(a) - g'(b)) 1. 


Suppose that f and g are 1-1 and continuous on R. If f(2)=1, 
g(-HD=1, f’(2) = 3, and g’(-1) = —4, compute the following 
derivatives. 
a) (f-')') 


b) (gy) 
c) (f~!g7!y') 


Let f(x) =x? log x*, and assume that (log x)’ = 1/x for all x € (0, 00). 


a) Show that f~! exists and is differentiable on (1, 00). 
b) Compute (f~!)’(2e?). 


Using the Inverse Function Theorem, prove that (arcsin x)’ = 1//1 — x? 
for x € (—1, 1) and (arctan x)’ = 1/(1 + x”) for x € (—00, 00). 

Suppose that f’ exists and is continuous on a nonempty, open interval 
(a, b) with f’(x) 4 0 for all x € (a, D). 


a) Prove that f is 1-1 on (a, b) and takes (a, b) onto some open interval 
(c,d). 

b) Show that f~! € C!(c, d). 

c) Using the function f(x) = x3, show that b) is false if the assumption 
f'(x) £ 0 fails to hold for some x € (a, b). 

d) Sketch the graphs of y = tanx and y = arctan to see that c andd in 
part b) might be infinite. 


Suppose that a := limy—.o0(1 + 1/x)* exists and is greater than 1 (see 
Example 4.22). Assume that a* : R — (0, 00) is onto, continuous, 
strictly increasing, and satisfies a*a” = a**t and (a*)” = a*” for all 
x,y € R (see Exercise 3.3.11). Let L(x) denote the inverse function 
of a*. 


4.5.6. 


4.5.7. 


4.5.8. 


4.5.9. 


4.5.10. 


*4.5.11. 
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a) Prove that tL(1+1/t) > last > ow. 

b) Prove that (a” — 1)/h > lash > 0. 

c) Prove that a* is differentiable on R and (a*)! = a’ for all x ER. 
d) Prove that L’(x) = 1/x for all x > 0. 


[Note: a is the natural base e and L(x) is the natural logarithm log x.| 
Suppose that J is a nondegenerate interval, that f : J > Ris differen- 
tiable, and that f’(x) 4 0 for all x € J. 


a) Prove that f—! exists and is differentiable on f (J). 
b) Suppose further that J is a closed, bounded interval and that f’ is 
continuous. Prove that (f~!)’ is bounded on f (J). 


Suppose that f : [a,b] — [c, d] is differentiable and onto. If f’ is never 
zero on [a, b] and d —c > 2, prove that for every x € [c, d] there exist 
x1 € [a, b] and x € [c, d] such that | f’(x1)(f~!(x) — f7!(x2))| = 1. 
Suppose that f is differentiable on a closed, bounded interval [a, b]. 
If f[a,b] = [a,b] and f’ is never zero on [a, b], prove that for every 
x € [a, b] there exist x1, x2 € (a, b) such that 


fo) = flandf' Caf) — f'@) + f@. 


Let [a, b] be a closed, bounded, nondegenerate interval. Find all func- 
tions f which satisfy the following conditions for some fixeda > 0: f 
is continuous and 1-1 on [a, b], f/(x) € Oand f’(x) = a(f—!)/(f(@)) 
for all x € (a, D). 

Suppose that f is C ' on an interval (a, b). If f’(xo) 4 0 for some xo € 
(a, b), prove that there exist intervals J and J such that f is 1-1 from J 
onto J and f~! is continuously differentiable on J. 

Suppose that f is differentiable at every point in a closed, bounded 
interval [a, b]. Prove that if f’ is 1-1 on [a, b], then f’ is strictly mono- 
tone on [a, D]. 


CHAPTER 5 


Integrability on R 


5.1 THE RIEMANN INTEGRAL 


In this chapter we shall study integration of real functions. We begin our discus- 
sion by introducing the following terminology. 


5.1 Definition. 
Let a,b € Rwitha < b. 


i) A partition of the interval [a, b] is a set of points P = {xo, x1,...x,} such 
that 


a=x9 <x] <:+++<x,=b. 


ii) The norm of a partition P = {xg, x1, ...X,} is the number 


Pil = max |xj—xj-1l. 
l<j<n 


iii) A refinement of a partition P = {xo,x1,...X,} is a partition Q of [a, b] 
which satisfies Q > P. In this case we say that Q is finer than P. 


5.2 EXAMPLE, [THE DYADIC PARTITION]. 


Prove that for eachn EN, P, = {j/2" : j = 0,1,...,2"} is a partition of the 
interval [0, 1], and P,, is finer than P, when m > n. 


Proof. Fixn €N. Ifx; = j/2”, then 0 = x9 < xj < +++ < x9 = 1. Thus, 
P,, is a partition of [0, 1]. Let m > n and set p= m—vn. If0 < j < 2”, then 
J/2" = j2?/2™ and0 < j2? <2”. Thus P,, is finer than Py. a 


It is clear that by definition, if P and Q are partitions of [a, b], then P U Q is 
finer than both P and Q. (Note that finer does not rule out the possibility that 
PU Q = Q, which would be the case if Q were a refinement of P.) And if Q is 
a refinement of P, then ||Q]|| < || P||. We shall use these observations often. 

Let f be nonnegative on an interval [a, b]. You may recall that the integral of 
f over [a, b] (when this integral exists) is the area of the region bounded by the 
curves y = f(x), y =0, x =a, and x = b. This area, A, can be approximated 
by rectangles whose bases lie in [a, b] and whose heights approximate f (see 
Figure 5.1). If the tops of these rectangles lie above the curve y = f(x), the 
resulting approximation is larger than A. If the tops of these rectangles lie below 
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FIGURE 5.1 


the curve y = f(x), the resulting approximation is smaller than A. Hence, we 
make the following definition. 


5.3 Definition. 


Let a,b € Rwitha < b, let P = {xo, x1,...x,} be a partition of the interval 
[a,b], set Ax; := x; —x;-1 for j = 1,2,...,n,and suppose that f : [a,b] > R 
is bounded. 


i) The upper Riemann sum of f over P is the number 


UEP => IG) ap 


j=l 
where 


Mj(f) := sup f(xj-1,xj):= sup f(t). 


te[xj-1,%;] 


ii) The lower Riemann sum of f over P is the number 


LG, P) =} my(f) Ay, 


j=l 
where 


mj(f) := inf f([xj-1,x;]):= inf ,7 @. 


te[xj-1 Xj 


(Note: Since we assumed that f is bounded, the numbers M;(f) and mj(f) 
exist and are finite.) 


Some specific upper and lower Riemann sums can be evaluated with the help 
of the following elementary observation. 


146 Chapter 5 Integrability on R 


5.4 Remark. /fg : N — R, then 


YS (g(k + 1) — gk) = gt) — g(m) 


k=m 
foralln>minN. 


Proof. The proof is by induction on n. The formula holds for n = m. If it 
holds for some n — 1 > m, then 


n 


So (gk +) = g() = (gv) — gm) + (gin + DY - gM) = gn + 1) - gm). 
k=m 


a 

We shall refer to this algebraic identity by saying the sum telescopes to 
g(n + 1) — g(m). In particular, if P = {xo,x1,..., xn} is a partition of [a, b], 
the sum Al Ax; telescopes to x, — x9 = b—a. 

Before we define what it means for a function to be integrable, we make 
several elementary observations concerning upper and lower sums. 
5.5 Remark. /f f(x) = a is constant on [a, b], then 

U(f, P) = L(f, P) = ab —a) 

for all partitions P of a, b]. 


Proof. Since M;(f) = mj;(f) = a for all j, the sums U(f, P) and L(f, P) 
telescope to a(b — a). a 


5.6 Remark. L(f, P) < U(f, P) for all partitions P and all bounded functions f. 
Proof. By definition, mj;(f) < M;(f) for all j. | 


The next result shows that as the partitions get finer, the upper and lower 
Riemann sums get nearer each other. 


5.7 Remark. Jf P is any partition of [a, b] and Q is a refinement of P, then 


L(f, P) <= L(f,Q) s Uf, Q) < UF, P). 


Proof. Let P = {xo, x1,...xn} be a partition of [a, b]. Since Q is finer than P 
Q can be obtained from P in a finite number of steps by adding one point at 
a time. Hence it suffices to prove the inequalities above for the special case 
Q = {c}UP for some c € (a,b). Moreover, by symmetry and Remark 5.6, 
we need only show U(f, Q) < U(f, P). 
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We may suppose that c ¢ P. Hence, there is a unique index jo such that 
Xjg—-1 < ¢ < Xj). By definition, it is clear that 
Uf, =U GP) = 6 =a) Mag —c) — M Any, 
where 


M© = sup f ([xj—1.c), M® =sup f([e, xj}, and 
M = sup f ([Xjo-1, Xjo])- 


By the Monotone Property of Suprema, M and M“” are both less than or 
equal to M. Therefore, 


U(f, Q)—U(f, P) < Mc — xjp-1) + M(QXjy — €) — M Axi, = 0. a 


5.8 Remark. /f P and Q are any partitions of [a, b], then 


L(f, P) < UC, Q). 


Proof. Since P U Q is arefinement of P and Q, it follows from Remark 5.7 
that 


L(f, P)<L(f,PUQ) <U(f, PUQ) < U(f, Q) 


for any pair of partitions P, Q, whether Q is a refinement of P or not. a 


We now use the connection between area and integration to motivate the 
definition of integrable. Suppose that f(x) is nonnegative on [a, b] and that the 
region bounded by the curves y = f(x), y = 0, x = a, andx = bhasa 
well-defined area A. By Definition 5.3, every upper Riemann sum is an over- 
estimate of A, and every lower Riemann sum is an underestimate of A (see 
Figure 5.1). Since the estimates U(f, P) and L(f, P) should get nearer to A as 
P gets finer, the differences U(f, P) — L(f, P) should get smaller. [The shaded 
area in Figure 5.2 represents the difference U(f, P) — L(f, P) for a particular 
P.| This leads us to the following definition (see also Exercise 5.1.9). 


AY 


FIGURE 5.2 
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5.9 Definition. 


Let a,b € Rwitha < b. A function f : [a,b] > R is said to be (Riemann) 
integrable on [a, b] if and only if f is bounded on [a, b], and for every « > 0 
there is a partition P of [a, b] such that U(f, P) — L(f, P) < «. 


Notice that this definition makes sense whether or not f is nonnegative. 
The connection between nonnegative functions and area was only a convenient 
vehicle to motivate Definition 5.9. Also notice that, by Remark 5.6, U(f, P) — 
L(f, P) =|UCf, P)-L(f, P)| for all partitions P. Hence, U(f, P)—L(f, P) <« 
is equivalent to |U(f, P) — L(f, P)| <e. 

This section provides a good illustration of how mathematics works. The con- 
nection between area and integration leads directly to Definition 5.9. This defi- 
nition, however, is not easy to apply in concrete situations. Thus, we search for 
conditions which imply integrability and are easy to apply. In view of Figure 5.2, 
it seems reasonable that a function is integrable if its graph does not jump 
around too much (so that it can be covered by thinner and thinner rectan- 
gles). Since the graph of a continuous function does not jump at all, we are 
led to the following simple criterion that is sufficient (but not necessary) for 
integrability. 


5.10 Theorem. Suppose that a,b € Rwitha < b. If f is continuous on the 
interval [a, b], then f is integrable on [a, b]. 


Proof. Let « > 0. Since f is uniformly continuous on [a, b], choose 6 > 0 
such that 


; ; € 
|x —y| <5 implies |f(x)— f(y)| < —. (1) 
Let P = {xo, x1, ..., Xn} be any partition of [a, b] which satisfies || P|| < 6. Fix 


an index j and notice, by the Extreme Value Theorem, that there are points 
Xm and xy in [x;~1, x;] such that 


f@m)=mj(f) and f(x) = M;(f). 
Since ||P || < 5, we also have |x — x| < 6. Hence by (1), Mj(f) —mj(f) < 
é/(b — a). In particular, 
U(f, P) — Lf, P) = (Mj (f) — mj(f) Axj < >) Axj =e. 
j=l j=l 
(The last step comes from telescoping.) a 


Although the converse of Theorem 5.10 is false (see Example 5.12 and 
Exercises 5.1.3, 5.1.6, and 5.1.8), there is a close connection between integra- 
bility and continuity. Indeed, we shall see (Theorem 9.69) that a function 
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is integrable if and only if it has relatively few discontinuities. This princi- 
ple is illustrated by the following examples. The nonintegrable function in 
Example 5.11 is nowhere continuous (hence has many discontinuities) but the 
integrable function in Example 5.12 has only one discontinuity (hence has few 
discontinuities). 


5.11 EXAMPLE. 
The Dirichlet function 
1 xEQ 
f(x)= 0 x40 


is not Riemann integrable on [0, 1]. 


Proof. Clearly, f is bounded on [0, 1]. By Theorem 1.18 and Exercise 1.3.3 
(Density of Rationals and Irrationals), the supremum of f over any nonde- 
generate interval is 1, and the infimum of f over any nondegenerate interval 
is 0. Therefore, U(f, P) — L(f, P) = 1—0 = 1 for any partition P of the 
interval [0, 1]; that is, f is not integrable on [0, 1]. | 


5.12 EXAMPLE. 
The function 
0 O<x<1/2 
fa) = ee 
ft t2<2<1 
is integrable on [0, 1]. 
Proof. Let ¢ > 0. Choose 0 < x; < 0.5 < x2 < 1 such that x. — x, < «. 
The set 
Poe {0,244,403} 


is a partition of [0, 1]. Since mi(f) = 0 = Mi(f), m2(f) =0 < 1 = Mr(f), 
and m3(f) = 1 = M3(f), it is easy to see that U(f, P)-—L(f, P) = x2-x1 <e. 
Therefore, f is integrable on [0, 1]. a 


We have defined integrability, but not the value of the integral. We remedy 
this situation by using the Riemann sums U(f, P) and L(f, P) to define upper 
and lower integrals. 


5.13 Definition. 
Let a,b € Rwitha < b, and f : [a,b] — R be bounded. 


i) The upper integral of f on [a, b] is the number 


b 
wf f(x) dx :=inf{U(f, P) : P isa partition of [a, b]}. 
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5.13 Definition. (Continued) 
ii) The lower integral of f on [a, b] is the number 
b 
w | f(x) dx := sup{L(f, P) : P is a partition of [a, b]}. 
a 


iti) If the upper and lower integrals of f on [a,b] are equal, we define the 
integral of f on [a, b] to be the common value 


b b b 
[ roa=w| fla) dx = (Ly f f(x) dx. 


This defines integration over nondegenerate intervals. Motivated by the inter- 
pretation of integration as area, we define the integral of any bounded function 
f on [a, a] to be zero; that is, 


a f(x) dx :=0. 


Although a bounded function might not be integrable (see Example 5.11 
above), the following result shows that the upper and lower integrals of a 
bounded function always exist. 


5.14 Remark. /f f : [a,b] — R is bounded, then its upper and lower integrals 
exist and are finite, and satisfy 


b b 
w) | fis)dx = (uy [ (Ovex: 


Proof. By Remark 5.8, L(f, P) < U(f, Q) for all partitions P and Q of [a, b]. 
Taking the supremum of this inequality over all partitions P of [a, b], we have 


b 
w | f(x) dx < U(f, Q); 


that is, the lower integral exists and is finite. Taking the infimum of this last 
inequality over all partitions Q of [a, b], we conclude that the upper integral 
is also finite and greater than or equal to the lower integral. 


Suppose that f is bounded and nonnegative on [a, b]. Since the upper and 
lower sums of f approximate the “area” of the region bounded by the curves 
y= f(x), y = 0, x =a, and x = b, we guess that f is integrable if and only 
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if the upper and lower integrals of f are equal. The following result shows that 
this guess is true whether or not f is nonnegative. 


5.15 Theorem. Leta,b € Rwitha < b, and f : [a,b] — R be bounded. Then f 
is integrable on [a, b] if and only if 


b b 
wf foyas= uy [ fooae. 2) 
a a 
Proof. Suppose that f is integrable. Let ¢ > 0 and choose a partition P of 
[a, b] such that 
UF, P) = LG P). = = (3) 
By definition, (U) f e f(x)dx < U(f, P) and the opposite inequality holds 


for the lower integral and the lower sum L(f, P). Therefore, it follows from 
Remark 5.14 and (3) that 


b b 
=u) [ fis) dx —(L) [ f(x) dx 
<U(f, P)—L(f, P) <e. 


b b 
lw | fla) dx ~(L) | f(x) dx 


Since this is valid for all ¢ > 0, (2) holds as promised. 
Conversely, suppose that (2) holds. Let e > 0 and choose, by the Approxi- 
mation Property, partitions P; and P> of [a, b] such that 


b 
U(f. Pi) < wf foydx +é 
and 


b 
LP) S w | fo)dx —§. 


Set P = P; U Po. Since P is a refinement of both P; and Py, it follows from 
Remark 5.7, the choices of Pj and P2, and (2) that 


b A b : 
<u) | flxyax+5— cy | fa)dx +5 =. . 


Since the integral has been defined only on intervals [a, b], we have tacitly 
assumed that a < b. We shall use the convention 


a b 
/ fend =— f f(x) dx 
b a 
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to extend the integral to the case a > b. In particular, if f(x) is integrable 
and nonpositive on [a, b], then the area of the region bounded by the curves 
y= f(x), y=0, x =a, and x = bis given by is ft (x)dx. 

In the next section we shall use the machinery of upper and lower sums to 
prove several familiar theorems about the Riemann integral. We close this sec- 
tion with one more result which reinforces the connection between integration 
and area. 


5.16 Theorem. Jf f(x) =a is constant on [a, b], then 


b 
/ f(x) dx =a(b—a). 


Proof. By Theorem 5.10, f is integrable on [a,b]. Hence, it follows from 
Theorem 5.15 and Remark 5.5 that 


b b 
[ toa-w f() dx = inf U(f, P) = a — a). | 


EXERCISES 


5.1.0. 


5.1.1. 


Suppose that a < b < c. Decide which of the following statements are 
true and which are false. Prove the true ones and give counterexamples 
for the false ones. 


a) If f is Riemann integrable on [a, b], then f is continuous on [a, d]. 

b) If |f| is Riemann integrable on [a, b], then f is Riemann integrable 
on [a, b]. 

c) For all bounded functions f : [a,b] > R, 


b b b 
w) | foxyax sf fx)ax <u) [ f(x) dx. 


d) If f is continuous on [a, b) and on [b,c], then f is Riemann inte- 
grable on [a, c]. 


For each of the following, compute U(f, P), L(f, P), and fis ft (x)dx, 


where 
1 
P= fo. 5 12h 
2 


Find out whether the lower sum or the upper sum is a better approxi- 
mation to the integral. Graph f and explain why this is so. 


a) f(x) = 3x? 
b) f(x) = 10 — 3x? 
c) f(x) = tan(x/4) 
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5.1.2. a) Prove that for eachn € N, 


is a partition of [0, 1]. 
b) Prove that a bounded function f is integrable on [0, 1] if 


(*) Io:= lim L(f, Pah) = lim UCf, Pn), 
noo noo 


in which case fh f (x)dx equals Ip. 
c) For each of the following functions, use Exercise 1.4.4 to find for- 
mulas for the upper and lower sums of f on P,, and use them to 


compute the value of i f(x)dx. 


a) f(x) =x 
By fQ)=x 

JO O<x<1/2 
y Feo" (eset 


5.1.3. Let E = {1/n: n € N}. Prove that the function 


1 xeEE 
P= i otherwise 


is integrable on [0, 1]. What is the value of i f (x)dx? 


5.1.4|. This exercise is used in Section *14.2. Suppose that a < b and that 


f : [a,b] — Ris bounded. 


a) Prove that if f is continuous at x9 € [a, b] and f (xo) 4 0, then 
b 
w | | f(x)| dx > 0. 
a 


b) Show that if f is continuous on [a, b], then rig |f(x)|dx = 0 if and 
only if f(x) = 0 for all x € [a, b]. 

c) Does part b) hold if the absolute values are removed? If it does, 
prove it. If it does not, provide a counterexample. 


5.1.5. Suppose that a < b and that f : [a,b] > Ris continuous. Show that 


[ ferax =o 
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for all c € [a, b] if and only if f(x) = 0 for all x € [a, b]. (Compare with 
Exercise 5.1.4, and notice that f need not be nonnegative here.) 

5.1.6. Let f be integrable on [a, b] and E be a finite subset of [a, b]. Show that 
if g is a bounded function which satisfies g(x) = f(x) for all x € [a, b]\E, 
then g is integrable on [a, b] and 


b b 
[ swar=| f(x) dx. 


5.1.7 |. This exercise is used in Section 12.3. Let f, g be bounded on [a, b]. 
a) Prove that 


b b b 
(U) / (f(x) + g(x)) dx < (U) / re ene / eterdy 


and 


b b b 
w | (f(x) + g(x)) dx 2 w | f(x) ax+(L) [ 8(x) dx. 


b) Prove that 


b Cc b 
(U) if f(x) dx =(U) / f(x) dx +(U) / OEE 
and 
b Cc b 
(L) / pare) / err rns i Foye 


fora<c <b. 


5.1.8 |. This exercise is used in Sections *5.5, 6.2, and *7.5. 


a) If f is increasing on [a,b] and P = {xo,..., xn} is any partition of 
[a, b], prove that 


YS \(Mj(f) — mj(f)) Ax; < (Ff) — f@) IPI. 


j=l 


b) Prove that if f is monotone on [a, b], then f is integrable on [a, b]. 
[Note: By Theorem 4.19, f has at most countably many (i.e., relatively 
few) discontinuities on [a, b]. This has nothing to do with the proof of 
part b), but points out a general principle which will be discussed in 
Section 9.8.] 
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5.1.9. Let a < b and 0 <c < d be real numbers and f : [a, b] > [c, d]. If f is 
Riemann integrable on [a, b], prove that ./f is Riemann integrable on 
[a, b]. 
5.1.10. Let f be bounded on a nondegenerate interval [a, b]. Prove that f is 
integrable on [a, b] if and only if given e > 0 there is a partition P, of 
[a, b] such that 


PDP, implies |U(f, P)—L(f, P)| <e. 


5.2 RIEMANN SUMS 


There is another definition of the Riemann integral frequently found in elemen- 
tary calculus texts. 


5.17 Definition. 
Let f : [a,b] > R. 
i) A Riemann sum of f with respect to a partition P = {xo,...,x,} of [a, b] 
generated by samples f; € [x;-1, x;] is asum 
n 
SH PLES YFG Ary 


j=! 


ii) The Riemann sums of f are said to converge to I(f) as ||P|| > 0 if and 
only if given ¢ > 0 there is a partition P; of [a, b] such that 


P = {xo,..-,Xn} D Pe implies |S(f, P,tj) —1(f)| <e 


for all choices of t; € [xj-1, xj], j = 1,2,...,n. In this case we shall use 
the notation 


1) = jim SU, Pty) = yo De f (tj) Axj. 


The following result shows that this definition of the Riemann integral is the 
same as the one using upper and lower integrals. 


5.18 Theorem. Leta,b € R witha < b, and suppose that f : [a,b] > R. Then 
fis Riemann integrable on (a, b] if and only if 


(fy = yo 2 f(tj) Axj 


exists, in which case I(f) = i f (x)dx. 
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Proof. Suppose that f is integrable on [a, b] and that e > 0. By the Approxi- 
mation Property, there is a partition P, of [a, b] such that 


b b 
Lif Pe) > f f(x)dx—e and us. Pe) < | f(x) dx +e. (4) 


Let P = {xo, X1,.--,Xn} D Pe. Then (4) holds with P in place of P,. But 
mj(f) < f(tj)) < M;(f) for any choice of t; € [x;-1, x;]. Hence, 


b n b 

i f(x)dx—e<L(f,P)< S> f(t) Ax; < U(f, P) <| Rade es: 
a j=l a 

that is, —e < Eee f (tj) Axj — ie f(x)dx < e. We conclude that 


n b 
Sf) Ax; - | f(x) dx| <e 
jal a 


for all partitions P > P, and all choices of tj € [xj-1, xj], j =1,2,...,n. 
Conversely, suppose that the Riemann sums of f converge to /(f). Let 
€é > Oand choose a partition P = {xo, x1,..., Xn} of [a, b] such that 


Sf yj) Axj —1(f) 


j=l 


a (5) 


for all choices of t; € [xj-1,x;]. Since f is bounded on [a,b] (see 
Exercise 5.2.11), use the Approximation Property to choose t;, uj € [xj-1, x;] 
such that f(t;)—f(uj) > M;(f)—m;(f)—e/(3(b—a)). By (5) and telescoping, 
we have 


UCf, P)— L(f, P) = > (My(f) — mj(f)) Ax; 
j=l 


” E 
< (F(t) — f(uj)) Axj + oa > Ax; 


j=l 


<|>0 f(t) Axj - 1(f) 
j=l 


n 


” é 
+ 1A YS) Ax; + 3p =@ 2 At 


Therefore, f is integrable on [a, b]. at 
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The next two results show that Riemann integrals of complicated functions 
can be broken into simpler pieces. 


5.19 Theorem. [LINEAR PROPERTY]. 


If f, g are integrable on [a,b] anda € R, then f + g and af are integrable on 
[a, b]. In fact, 


b b b 
/ (f(x) + g(x) ax= f f(x) ax+ | 8(x) dx (6) 


a 


and 
b b 
i (af (x)) dx = a | f(x) dx. (7) 


Proof. Let « > 0 and choose P, such that for any partition P = {xo, x1,..., 
Xn} 2 P, of [a, b] and any choice of t; € [x;-1, x;], we have 


n b € 
Fay) ax;- | f(x) dx| < 5 
j=l = 

and 


Yew) ax [ g(x) dx =. 


By the Triangle nee 


n n b b 
Sofa) Axjy + D> g(t) Ax; -{ f(x) ax— [ g(x) dx| <¢ 
j=l j=l a a 


for any choice of t; € [x;~1,x;]. Hence, (6) follows directly from Theo- 
rem 5.18. 

To prove (7), we may suppose that a # 0. Choose P, such that if P = 
{xo,..., Xp} is finer than P,, then 


n b 
SF) ax - | f(x) dx] < a 
j=l : 


for any choice of t; € [xj—1, x;]. Multiplying this inequality by |a|, we obtain 


dere dxj-af f(x) dx < lal = | 
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for any choice of t; € [xj-1,x;]. We conclude by Theorem 5.18 that (7) 
holds. a 


5.20 Theorem. [ff is integrable on [a, b], then f is integrable on each subinterval 
[c, d] of [a, b]. Moreover, 


b c b 
i flxydx = f fesyax+ [ f(x) dx (8) 
for allc € (a, b). 


Proof. We may suppose that a < b. Let « > 0 and choose a partition P of 
[a, b] such that 


U(f, P) —L(f, P) <e. (9) 


Let P’ = P U{c} and P; = P’[a, c]. Since P, is a partition of [a, c] and P’ is 
a refinement of P, we have by (9) that 


OCR PDH LG Pp =O, PSDP SUG Pye Ler yes: 


Therefore, f is integrable on [a,c]. A similar argument proves that f is inte- 
grable on any subinterval [c, d] of [a, b]. 

To verify (8), suppose that P is any partition of [a,b]. Let Py = PU 
{c}, Py = PoN[a, c], and Pp = PoN[c, b]. Then Py = P, U P2 and by definition 


U(f, P) = Uf, Po) =U(F, Pi) + UG, Po) 


c b c b 
> wy | f(x) dx + (U) | fax = f feax+ [ f(x) dx. 


(This last equality follows from the fact that f is integrable on both [a, c] and 
[c, b].) Taking the infimum of 


Cc b 
UC, py> f fsyax+ [ Fx) dx 


over all partitions P of [a, b], we obtain 


b b Cc b 
i, fx)ax =u) [ fear f fooyax+ [ F(x) de. 


A similar argument using lower integrals shows that 


b c b 
[ soars | fesyax+ [ f (x) dx. | 
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Using the conventions 


b a a 
i f(x) dx = -| f(x) dx and / f(x) dx =0, 
a b a 


it is easy to see that (8) holds whether or not c lies between a and b, provided f 
is integrable on the union of these intervals (see Exercise 5.2.4). 


5.21 Theorem. [COMPARISON THEOREM FOR INTEGRALS]. 
If f, g are integrable on [a, b] and f(x) < g(x) forall x € [a, b], then 


b b 
[ soars | pO te 


In particular, ifm < f(x) < M for x € [a, b], then 


b 
m(b — a) <|/ f(x) dx < M(b— a). 


Proof. Let P be a partition of [a, b]. By hypothesis, M;(f) < Mj;(g) whence 
U(f, P) < U(g, P). It follows that 


b b 
i etre wf Aa SUED 


for all partitions P of [a,b]. Taking the infimum of this inequality over all 
partitions P of [a, b], we obtain 


b b 
[ soars | g(x) dx. 


Ifm < f(x) < M, then (by what we just proved and by Theorem 5.16) 


b b b 
mb —a) = [ madx = | fas s f M dx = M(b—a). a 


We shall use the following result nearly every time we need to estimate an 
integral. 


5.22 Theorem. /f f is (Riemann) integrable on [a,b], then | f| is integrable on 


[a, b] and 
b 
/ f(x) dx 


Proof. Let P = {xo, x1, ..., Xn} be a partition of [a, b]. We claim that 


Mf) —m (Ff) < Mi(f) -—m,(f) (10) 


b 
<|/ If ()| de. 
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holds for j = 1, 2,...,n. Indeed, let x, y € [xj-1, xj]. If f(x), f(y) have the 
same sign, say both are nonnegative, then 


If) — IFO) = FO) — FO) s Mi(~) — mj (Pf). 


If f(x), f(y) have opposite signs, say f(x) => 0 > f(y), then m;(f) < 0 
and, hence, 


If@)| - IFO = F@)+ FO) = Mj(f) +0 5 Mj(f) —m,(f). 


Thus in either case, | f(x)| < Mj(f) —mj(f) + |f(y)|. Taking the supremum 
of this last inequality for x € [x;—1, x;] and then the infimum as y € [x;-1, x;], 
we see that (10) holds, as promised. 

Let e > Oand choose a partition P of [a, b] such that U(f, P)—L(f, P) < «. 
Since (10) implies U(|f|, P) — L(|f|, P) < UCf, P) — L(f, P), it follows that 


U(Ifl,P) -—LUFfl, P) <e. 


Thus | f| is integrable on [a, b]. Since —| f(x)| < f(x) < |f(x)| holds for any 
x € [a, b], we conclude by Theorem 5.21 that 


b b b 
-| (Pea ae < | fa) dx < | i@lae. a 


By Theorem 5.19, the sum of integrable functions is integrable. What about 
the product? 


5.23 Corollary. If f and g are (Riemann) integrable on [a, b], then so is fg. 


Proof. Suppose for a moment that the square of any integrable function is 
integrable. Then, by hypothesis, ree ae and (f + g) are integrable on 
[a, b]. Since 
(f+ —-f?-8 
fg= 5 ; 
it follows from Theorem 5.19 that fg is integrable on [a, b]. 


It remains to prove that f? is integrable on [a, b]. Let P be a partition of 
[a, b]. Since M;(f?) = (M,(|f|))? and m;(f?) = (mj (| f|))”, it is clear that 


M,(f7) —m;(f*) = (Mj FD)* — (mf)? 
= (Mj (IF) +m UF DMF) — mj (FD) 
< 2M(M(\f |) —mj(f1)), 


where M = sup|f|([a, b]); that is, | f(x)| < M for all x € [a, b]. Multiplying 
the displayed inequality by Ax; and summing over j = 1, 2,...,n, we have 


U(f?, P) — L(f*, P) < 2MU(|f|, P) — Ld fl. P)). 
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Hence, it follows from Theorem 5.22 that f? is integrable on [a, b]. a 


We close this section with two integral analogues of the Mean Value 
Theorem. 


5.24 Theorem. [FIRST MEAN VALUE THEOREM FOR INTEGRALS]. 
Suppose that f and g are integrable on [a, b] with g(x) > 0 forall x ¢€ [a, b]. If 


m=inf f[a,b] and M =sup fla, b], 
then there is anumber c € [m, M] such that 


b b 
/ fWe@) dx =e i pede 


In particular, if f is continuous on [a, b], then there is an xy € [a,b] which 
satisfies 


b b 
i) f@)g(x) dx = Fevo) f g(x) dx. 


Proof. Since g => 0 on [a, b], Theorem 5.21 implies 
b b b 
m | g(x) dx </ f(x)g(x) dx em | g(x) dx. 
a a a 


If i g(x)dx = 0, then fe f(x)g(x)dx = 0 and there is nothing to prove. 
Otherwise, set 


ow da FRC) dx 


fi ia g(x) dx 
and note that c € [m, M]. If f is continuous, then (by the Intermediate Value 
Theorem) we can choose xo € [a, b] such that f (x0) =c. a 


Before we state the Second Mean Value Theorem, we introduce an idea that 
will be used in the next section to prove the Fundamental Theorem of Calculus. 
If f is integrable on [a, b], then f can be used to define a new function 


F(x) i f(t) dt, x € [a, b]. 


5.25 EXAMPLE. 
Find F(x) = [> f(t) dt if 
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Solution. By Theorem 5.16, 


ry=frou= |" x20 
0 —Xx x< 


Hence, F(x) = |x|. | 


Notice in Example 5.25 that the integral F of f is continuous even though f 
itself is not. The following result shows that this is a general principle. 


5.26 Theorem. Jf f is (Riemann) integrable on [a,b], then F(x) = ie f@) dt 
exists and is continuous on [a, b]. 


Proof. By Theorem 5.20, F(x) exists for all x € [a,b]. To prove that F is 
continuous on [a, b], it suffices to show that F(x+) = F(x) for all x € [a, b) 
and F(x—) = F(x) for all x € (a,b). Fix x9 € [a,b). By definition, f is 
bounded on [a, b]. Thus, choose M € R such that | f(4)| < M for allt € [a, b]. 
Let e > Oand set 6 = ¢/M. If 0 < x — x9 < 6, then by Theorem 5.22, 


x x 
|F(x) — F(xo)| = / f(t) dt < | |f(t)| dt < M|x —xo| <e. 
x0 x0 
Hence, F(xo+) = F(xo). A similar argument shows that F(x9—) = F (xo) for 
all xo € (a, b]. |_| 


5.27 Theorem. [SECOND MEAN VALUE THEOREM FOR INTEGRALS]. 
Suppose that f, g are integrable on [a, b], that g is nonnegative on [a, b], and that 
m, M are real numbers which satisfy m < inf f ({a, b]) and M > sup f ({a, b)). 
Then there is anc € [a, b| such that 


b c b 
/ Flx)g@) dx =m f (x) dx +m [ g(x) dx. 


In particular, if f is also nonnegative on [a, b], then there is anc € [a, b] which 
Satisfies 


b b 
/ fodg(x) dx =M / OE 


Proof. The second statement follows from the first since we may use m = 0 
when f > 0. To prove the first statement, set 


x b 
Fo) =m | (drm f g(t) dt 


for x € [a,b], and observe by Theorem 5.26 that F is continuous on [a, b]. 
Since g is nonnegative, we also have mg(t) < f(t)g(t) < Meg(t) for allt € 
[a, b]. Hence, it follows from the Comparison Theorem (Theorem 5.21) that 
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b b b 
ren g(t) dt < / F(de(t) dt <M / pide = FU). 


Since F is continuous, we conclude by the Intermediate Value Theorem that 
there is anc € [a, b] such that 


b 
F(c)= / f@g(t) dt. 


When g(x) = 1 and f(x) > 0, these mean value theorems have simple geo- 
metric interpretations. Indeed, let A represent the area bounded by the curves 
y= f(x), y =0, x =a, and x = b. By the First Mean Value Theorem, there 
is ac € [m, M] such that the area of the rectangle of height c and base b — a 
equals A (see Figure 5.3). And by the Second Mean Value Theorem, if M is the 
maximum value of f on [a, b], then there is an c € [a, b] such that the area of 
the rectangle of height M and base b — c equals A (see Figure 5.4). 


AY 


y =f(x) 


#Y 


FIGURE 5.3 
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EXERCISES 


5.2.0. 


5.2.1. 


5.2.2. 


5.2.3. 


Suppose that a < b. Decide which of the following statements are true 
and which are false. Prove the true ones and give counterexamples for 
the false ones. 


a) If f and g are Riemann integrable on [a, b], then f — g is Riemann 
integrable on [a, b]. 

b) If f is Riemann integrable on [a, b] and P is any polynomial on R, 
then P o f is Riemann integrable on [a, b]. 

c) If f and g are nonnegative real functions on [a, b], with f continuous 
and g Riemann integrable on [a, b], then there exist x9, x1 € [a, b] 
such that 

b 


b 
/ f(x)g(x) dx = f(xo) | g(x) dx. 


x 
d) If f and g are Riemann integrable on [a, b] and f is continuous, then 
there is an xo € [a, b] such that 


b b 
i f(x)g(x) dx = reso) | g(x) dx. 


Using the connection between integrals and area, evaluate each of the 
following integrals. 


4 
a) / |x —2| dx 
=I 


4 
b) f |x — 2| + |x| dx 
-1 
3 
2 | V9—x? dx 
0 
5 
a | (44+V5+4x — x?) dx 
-1 


a) Suppose that a < bandn € Nis even. If f is continuous on [a, b] and 


p? f(x)x"dx = 0, prove that f(x) = 0 for at least one x € [a, b]. 
b) Show that part a) might not be true if n is odd. 
c) Prove that part a) does hold for odd n when a + b 4 0. 


Use Taylor polynomials with three or four nonzero terms to verify the 
following inequalities. 


1 
a) 0.23 <|/ sin(x*) dx < 0.2417 
0 


(The value of this integral is approximately 0.2338452.) 


5.2.4. 


5.2.5. 


5.2.6. 


5.2.7. 


5.2.8. 
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1 


b) 1.338 <|/ e* dx < 1.4514 
0 


(The value of this integral is approximately 1.3419044.) 


Suppose that f : [0, co) — [0, 0) is integrable on every closed interval 
[a, b] c [0, ov). If 


F(x) := ik e-” f(y) dy, x €[0, 00), 


then there is a function g : [0,0o©) — [0,0o) such that F(x) = 


See) f(y) dy for all x € [0, 00). 


Prove that if f is integrable on [0, 1] and 6 > 0, then 


noo 


1/n8 
lim nf f(x) dx =0 
0 


for alla < p. 
a) Suppose that g, > 0 is a sequence of integrable functions which 
satisfies 


b 
lim i &n(x) dx = 0. 
n—->Co a 
Show that if f : [a, b] — R is integrable on [a, b], then 
b 
lim / F(x) gn(x) dx = 0. 
n->Co a 
b) Prove that if f is integrable on [0, 1], then 
1 
lim / x" f(x) dx =0. 
n> Co 0 


Suppose that f is integrable on [a, b], that x9 = a, and that x, is a 
sequence of numbers in [a, b] such that x, + b asin — ov. Prove that 


b fs ips e 
/ f(x) dx = lim >| ‘ ' FO) dx. 
a caer’ Xp 


Let f be continuous on a closed, nondegenerate interval [a, b] and set 


M= sup |f(x)I. 
x€[a,b] 
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a) Prove that if M > 0 and p > 0, then for every « > O there is a 
nondegenerate interval J Cc [a, b] such that 


b 
(M —e)?IT| </ Lf GI? dx < M?(b —a). 


b) Prove that 


b !/p 
im (/ | f (x)? ax) =M. 


5.2.9. Let f : [a,b] ~ R, a = x9 < x1 < +--+ < xX, = b, and suppose that 
Ff (xg+) exists and is finite for k = 0,1,...,n2—1 and f(x,—) exists and is 
finite fork = 1,...,n. Show that if f is continuous on each subinterval 


(x~-1, Xx), then f is integrable on [a, b] and 


b n XE 
i fo\de= f(x) dx. 


k=1 Xk-1 


5.2.10. Prove that if f and g are integrable on [a, b], then so are f V g and f Ag 


(see Exercise 3.1.8). 


5.2.11. Suppose that f : [a,b] > R. 


a) If f is not bounded above on [a, b], then given any partition P of 
[a,b] and M > 0, there exist t; € [xj~1, x;] such that S(f, P,t;) > M. 

b) If the Riemann sums of f converge to a finite number /(f), as || P|| 
— 0, then f is bounded on [a, d]. 


5.3. THE FUNDAMENTAL THEOREM OF CALCULUS 


Let f be integrable on [a,b] and F(x) = is f@)dt. By Theorem 5.26, F is 
continuous on [a,b]. The next result shows that if f is continuous, then F is 
continuously differentiable. Thus “indefinite integration” improves the behav- 


ior of 


the function. 


5.28 Theorem. [FUNDAMENTAL THEOREM OF CALCULUS]. 


Let 
i) 


ii) 


La, b] be nondegenerate and suppose that f : [a,b] > R. 
If f is continuous on (a, b] and F(x) = he f(t) dt, then F € C'![a, b] and 


d ft 
< f(t) dt = Fe) = fs) 
X Ja 


for each x € [a, b]. 
If f is differentiable on (a, b| and f' is integrable on [a, b], then 


/ f'@ dt = f(x) -— f@ 


for each x € [a, b]. 
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Proof. i) For x € [a,b], set F(x) = f f@)dt. By symmetry, it suffices to 
show that if f(xo+) = f (xo) for some xo € [a, b), then 


lim EGO em) BO) = f (xo) (11) 


h>0+ h 


(see Definition 4.6). Let ¢ > 0 and choose a 6 > O such that x9 < t < x9 +6 
implies | f(t) — f(xo)| < ¢. Hx 0 <h < 6. Notice that by Theorem 5.20, 


xoth 


Foo+h) — Fo) = [ f(t) dt 


x0 


and that by Theorem 5.16, 


1 xoth 
f (xo) = al f (xo) dt. 
Xo 


Therefore, 


F h)—F peor’ 
od) SEM) it (f(t) — fo) dt. 


h fo) =F 


0 


Since 0 < h < 6, it follows from Theorem 5.22 and the choice of 6 that 


F h) — F 
| (xo + h) — Fo) f(x) 


1 xo+h 
h <;/ If() — f@o)l dt <e. 


0 


This verifies (11) and the proof of part i) is complete. 
ii) We may suppose that x = b. Let « > 0. Since f’ is integrable, choose a 
partition P = {xo, x1,...,X»} of [a, b] such that 


<E 


n b 
Dra as- [rod 
j=l 7 


for any choice of points t; € [xj-1,x;]. Use the Mean Value Theorem to 
choose points t; € [xj-1,x;] such that f(x;) — f(xj-1) = f/(tj) Ax;. It 
follows by telescoping that 


b 
yo — f(a) -| f'() dt 


n b 
= Levey - fe)» - | f'(@) dt] <e. 
j=l 7 
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5.29 Remark. The hypotheses of the Fundamental Theorem of Calculus cannot 
be relaxed. 


Proof. i) Define f on [—1, 1] by 


—l x <0 


then f is integrable on [—1, 1], but F(x) := dees f(x)dx = |x| — 1 is not 
differentiable at x = 0. 

ii) Define f on [0, 1] by f(x) := x? sin(1/x*) when x 4 0 and f(0) = 0. 
Then f is differentiable on [0, 1], but 


; : 1 2 1 
f («) = 2x sin - COS ; x £0, 
(32) -2 (3) 


is not even bounded on (0, 1], much less integrable on [0, 1]. a 


By the Fundamental Theorem of Calculus, integration is the inverse of differ- 
entiation in the following sense. If f’ is integrable, then 


b 
[ roars sol} = 10 - fo. 


In particular, 


b atl 
x b 
/ aS | 
a 


for each a > 0, and for each a < 0, provided a 4 —1 and [a, b] is a subset 
of (0, 00) (see Exercises 4.2.3 and 5.3.7). This result is sometimes called the 
Power Rule. 

These observations can be used to evaluate many integrals. 


5.30 EXAMPLES. 
i) Find fj Gx — 2)? dx. 
ii) Find (7/"(1 + sinx) dx. 


Solution. i) Since (3x — 2)* = 9x* — 12x +4, we have by the Power Rule that 


1 
i (3x — 2)? dx = 3x? — 6x7 +4x = 1. 
0 


ii) Since (cos x)’ = —sinx, we have by the Fundamental Theorem of Calcu- 
lus that 
m/2 
/ (1+ sinx) dx =x —cosx [= 5 +1. z 
0 
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Combining the Product Rule and the Fundamental Theorem of Calculus, we 
have another tool for evaluating integrals. 


5.31 Theorem. [INTEGRATION BY PARTS]. 
Suppose that f, g are differentiable on (a, b] with f', g' integrable on [a, b]. Then 


b b 
/ fxg (x) dx = f(b)g(b) — f (aga) -| f(x)g! (x) dx. 


Proof. By the Product Rule, (f(x)g(x))’ = f’(x)g(x) + f(x)g/(x) for x € 
[a, b]. Since f, g are continuous on [a, b] and f’, g’ are integrable on [a, d], 
it follows that (fg)/ is a sum of products of integrable functions and, hence, 
integrable on [a, b]. Thus, by the Fundamental Theorem of Calculus, 


b b 
Fidyg(b) ~ flargta) = | f'(x)g(x) ax [ f (x)g' (x) dx. a 


This rule is sometimes abbreviated as 


[udv=w- f van. 


where it is understood that if w = h(x) for some differentiable function h, then 
the Leibnizian differential dw is defined by dw = h'(x) dx. 

Integration by parts can be used to reduce the exponent n on an expression 
of the form (ax + b)” f(x) when f is integrable. 


5.32 EXAMPLE. 


Find Te x sinx dx. 


Solution. Let u = x and dv = sinx dx. Then du = dx and v = —cosx. Hence, 
by parts, 


m/2 x/2 m/2 x/2 
i x sinx dx = —x cosx lo -| (—cosx) dx = sinx iF — ely a 
0 0 


Integration by parts is also very effective on integrals involving products of 
polynomials and logarithms. 
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5.33 EXAMPLE. 
Find f; log x dx. 


Solution. Let u = logx and dv = dx. Then du = dx/x and v = x. Hence, by 
parts, 


3 3 
/ logx dx = xlogs |} ~ f dx = 3log3 —2. a 
1 1 


Complicated problems can frequently be reduced to simpler ones by changing 
variables. The following result shows how to change variables in a Riemann 
integral on R. 


5.34 Theorem. [CHANGE OF VARIABLES]. 
Let ¢ be continuously differentiable on a closed, nondegenerate interval 


La, b). If 
¢’ is nonzero on [a, b], (12) 
and if 
f is integrable on [c,d] := la, b], (13) 
then f og + |¢'| is integrable on [a, b], and 


d b 
/ ft) a= | F(G(x)) + 16’) dx. (14) 


STRATEGY: By the Mean Value Theorem, hypothesis (12) implies that ¢ is 1-1 
on [a, b]. Hence by Theorem 4.32, ¢ is strictly monotone on [a, b] and [c, d] := 
o[a, b] is a closed interval. 

Suppose that ¢ is strictly increasing on [a, b]; that is, |¢’| = ¢’ and [c,d] = 
[¢(a), 6(b)]. By Theorem 4.32, @! is increasing on [c,d]. Thus if P = {io, 
t},...,f} 18 a partition of [c,d] and xj = o-\(t;), then P := {xo,x1,...,Xn;}isa 
partition of [a, b]. A Riemann sum of the right side of (14) looks like 


S(fob-|d'l, Posi) = >> F@(s{))I6'(j)| Ary. (15) 


j=! 


On the other hand, a typical term of a Riemann sum of the left side of (14) 
looks like 


f(uj) Axj = f(uj) (b(t) — b(tj-1)). 


Since ¢’, hence ¢, is continuous, we can use the Intermediate Value Theorem to 
choose s; € [x;-1,x;] such that u; = ¢(s;), and the Mean Value Theorem to 
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choose cj € [xj-1, xj] such that @(x;) — @(xj-1) = (cj) Ax;. It follows that a 
Riemann sum the left side of (14) looks like 


S(f, P,uj) = D> £G(s))) (C7) Ax;. 


j=l 


The only difference between this last sum and (15) is that s; has been replaced 
by c;. Since c; and s; both belong to the interval [x ;_1, x;] and ¢’ is continuous, 
making this replacement should not change S much if the norm of P is small 
enough. Hence, a Riemann sum of the left side of (14) is approximately equal 
to a Riemann sum of the right side of (14). This means the integrals in (14) 
should be equal. Here are the details. 

Case 1. Suppose that ¢ is strictly increasing on [a, b]. Let e > 0. Since f is 
bounded, choose M ¢€ (0, 00) such that | f(x)| < M for all x € [c, d]. Since ¢’ is 
uniformly continuous on [a, b], choose 6 > 0 such that 


/ oe / : é : 
0) -9' CDI < AGH’ 


that is, 


IF(O(s))(O'(s;) — H'(Cj))I < (16) 


(= ee 
2(b — a) 


for all s;,c; € [a, b] with |s; —cj| <6. 

Next, use the Inverse Function Theorem to verify that ¢~! is continuously 
differentiable on [c,d]. Thus there is an 7 > O such that if s,c € [c,d] and 
ls —c| < n, then |@~!(s) —@7!(0)| < 8. 

Finally, since f is integrable on [c, d] = [¢(a), (b)], choose a partition P = 
{to, t1, ...t} of [c, d] such that || P|| < » and 


< 5 (17) 


(bd) 
o@ 


holds for any choice of uj; € [tj-1, tj]. 

Set xj = o \(t;) and observe (by the choice of 7) that P:= {xo,..-,X%,}isa 
partition of [a, b] which satisfies \|P || < 6. 

Let s; € [xj-1, xj], set uj = (s;), and apply the Mean Value Theorem to 
choose cj € [xj—-1,x;] such that 6(x;) — @(x4j-1) = $'(c;) Ax;. Then, by the 
choices of cj, uj, and t;, we have uj € [t;-1, tj] and 


Ff @(s;))6' (cj) Axy = f(us\ OG) — 6@j-1)) = FUG —t;-1). 
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Hence, it follows from (16) and (17) that 


" $(b) 
SF (s)))$'(s)) Axj [ fod 


j=l 


< y FP (sp) G'(Sj) — O'(cj)) Ax; 


j=l 


+ 


= o(b) 
Y feng -yv- f f(t) dt 
j=l $(a) 


€ é€ 
a Ag ee 
* 2b=a) & ae as 


We obtained this estimate for the fixed partition P of [a, b], but the same steps 
also verify this estimate for any partition finer than P. We conclude by Theo- 
rem 5.18 that (f o @) - |¢’| is integrable on [a, b] and (14) holds. 

Case 2. ¢ is strictly decreasing on [a, b]. Repeat the proof in case 1. The only 
changes are P = {@~!(xp),...,@7!(xo)} and |¢’| = —#’. Thus the Mean Value 
Theorem implies that 


(xj—-1) — b(xj) = O' (cj) (Xj-1 — xj) = I'(Cj)| AX;. 


Estimating the Riemann sums as above, we again conclude that 


d b 
[toa | FP (x))  16"(O)| at. a 


The proof of Theorem 5.34 also establishes the following more familiar form 
of the Change of Variables Formula: If ¢ is C ' on [a, b], if d’ is never zero on 
[a, b], and if f is integrable on ¢[a, b], then 


g(b) b 
) f(t) dt =) Ff ((x)) p(x) dx. 
¢(a) a 

The difficult part of Theorem 5.34 was verifying that f o @ - |¢’| is integrable on 
[a, b] when f is integrable on [c, d]. If we assume that f is continuous, the proof 
is a lot easier. 


5.35 Theorem. [CHANGE OF VARIABLES FOR CONTINUOUS 
INTEGRANDS]. 

If ¢ is continuously differentiable on a closed, nondegenerate interval (a, b] and 
fis continuous on ¢$((a, b]), then 


o(b) b 
ys f(t) dt = SG (x))b' (x) dx. 
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Proof. Set 


G(x) ah fot) dt) dt, x €[a,b], and 


Fu) = f(t)dt, ue (a,b), 
$(a) 


and observe that if m is the infimum of ¢([a, b]), then F(u) = iis f(t)dt -— 


ie a (a) f@adt. It follows from the Fundamental Theorem of Calculus that 
G'(x) = f(b(x))¢'(x) and F’(u) = f(u). Hence, by the Chain Rule, 


d 
rs (G(x) — F@())) =0 
Xx 


for all x € [a, b]. It follows from Theorem 4.17ii that G(x) — F(@(x)) is con- 
stant on [a, b]. Evaluation at x = a shows that this constant is zero. Thus 
G(x) = F(¢(a)) for all x € [a, 5], in particular, when x = b. | 


These Change of Variables Formulas can be remembered as a substitution if 
we use the Leibnizian differentials introduced above: u = ¢(x) implies du = 
g' (x)dx. 

Besides the usual applications to finding exact values for integrals of com- 
positions of functions, the Change of Variables Formula can also be used to 
estimate the value of an integral. Since energy, power, force, and many other 
physical quantities can be computed by integration, this technique has practical 
implications. For example, it sometimes allows one to use data from a particu- 
lar prototype to estimate what would happen if the prototype were redesigned, 
without going to the expense of building another prototype. 


5.36 EXAMPLE. 


Suppose that f is an unknown function which is nonnegative and continuous on 


[2, 5]. If data are collected that can be interpreted as ies f(x)dx = 3, find an 
upper bound for the integral 


2 
I a) f(a? +1) dx. 
1 


Solution. Let u = x* +1. Then du = 2x dx. Unlike textbook-style problems, 
we do not have a du term already in J. However, since x € [1,2] implies x > 1, 
and since f is nonnegative, it is clear that f (x?-+1) < 2xf(x*+1)/2. Therefore, 


3 
5" 


2 1 2 1 5 
bef for + idx sf 2xfla? + dx = 5 f f(u) du = a 
1 2 1 2 2 
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EXERCISES 


5.3.0. 


5.3.1. 


Suppose that a < b. Decide which of the following statements are true 

and which are false. Prove the true ones and give counterexamples for 

the false ones. 

a) If f is continuous and nonnegative on [a, b] and g : [a, b] — [a, b] is 
differentiable and increasing on [a, b], then 


g(x) 
F(x) =i f(t) dt 


is increasing on [a, b]. 
b) If f and g are differentiable on [a, b], if f’ and g’ are Riemann inte- 
grable on [a, b], and if f(a) = 0 but g is never zero on [a, b], then 


ey is fy * f(t)g’(t) 
f «=| oo (42) BOT Tessa te 


for all x € [a, b]. 
c) If f and g are differentiable on [a, b], and if f’ and g’ are Riemann 
integrable on [a, b], then 


b b 
i f'(x)g(x) dx +f f(x)g'(x) dx =0 


if and only if f(a)g(a) = f(b)g(d). 
d) If f and g are continuously differentiable on [a, b], and if h is contin- 
uous on [a, b], then 


g(f(b)) b : ; 

7. h(x) dx =) h(g(fa) Ns (fay f @) dx. 
g(f(a)) a 

If f : R > R is continuous, find F’(x) for each of the following 


functions. 


1 
a) Fo) = | f(t) dt 


3x4 
b) F@)= fat 


x 


c) Fa) = | tf (t) dt 
0 


x 


d) F(x) = f(xt+nhdt 


—-x 
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5.3.2. Suppose that f is nonnegative and continuous on [1,2] and that 


5.3.3. 


5.3.4. 


fe x* f (x)dx = 2k* +3 fork = 0,1,2. Prove that each of the follow- 
ing statements is correct. 


9 
a | f (vx) dx < 18 


: 1 3 
» Pianta) 8-3 
0 
| x’ f(l—x)dx =4 
—2 


Suppose that f is integrable on [0.5, 2] and that 


1 2 
/ fo) dx = | x* f(x) dx +2k* =3 4k 
0.5 1 


for k = 0,1,2. Compute the exact values of each of the following 
integrals. 


1 
a) / a £41) dx 
0 


V3/2- 43 
b) i Fai (v1 x?) dx 


Suppose that f and g are differentiable on [0, e] and that f’ and g’ are 
integrable on [0, e]. 


a) If fy f(x)/x dx < f(e), prove that 
/ f'(x) log x dx > 0. 
1 
b) If f() = fd) =0, prove that 
1 
i e*(f @) + f'@)) dx =0. 
c) If0 < {f(), g(0)} N{f(e), g(e)}, prove that 


[ fx)g! (x) dx = -{ g(x) f'(x) dx. 
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5.3.5. Use the First Mean Value Theorem for Integrals to prove the following 


version of the Mean Value Theorem for Derivatives. If f € C'[a, b], 
then there is an xo € [a, b] such that 


f(b) — f(a) = (b — a) f' (x0). 
5.3.6. If f is continuous on [a, b] and there exist numbers a # 6 such that 
¢c b 
af fiydx +B | f(x) dx =0 


holds for all c € (a, b), prove that f(x) = 0 for all x € [a, b]. 
5.3.7 |. This exercise is used in Sections 5.4 and 6.1. Define L : (0, co) > R by 


* dt 


1 


L@) = 


a) Prove that L is differentiable and strictly increasing on (0, oo), with 
Ea) =1/x and £0) = 0. 

b) Prove that L(x) — co as x > co and L(x) — —oo asx > 0+. (You 
may wish to prove 


raya fs Sark (at att) 8 
aay fT Lee )=5 
k=1 k=1 

for alln € N.) 

c) Using the fact that (x7)’ = qx7~! for x > 0 and q € Q (see Exer- 
cise 4.2.7), prove that L(x?) = qL(x) for allg € Qandx > 0. 

d) Prove that L(xy) = L(x) + L(y) for all x, y € (0, oo). 

e) Suppose that e = lim,-,.(1 + 1/n)” exists. (It does—see Exam- 
ple 4.22.) Use l’H6pital’s Rule to show that L(e) = 1. [L(x) is the 
natural logarithm function log x.] 


5.3.8 |. This exercise was used in Section 4.3. Let E = L~!, where L is defined 
in Exercise 5.3.7. 


a) Use the Inverse Function Theorem to show that E is differentiable 
and strictly increasing on R with E’(x)= E(x), E(0)=1, and E(1)=e. 

b) Prove that E(x) > co as x > oo and E(x) > Oas x > —o0. 

c) Prove that E(xq) = (E(x))4 and E(q) = ef for allg € Qandx ER. 

d) Prove that E(x + y) = E(x)E(y) for allx, y ER. 

e) For each a € R define e* = E(a). Let x > 0 and define x* = 
e%!8* :— E(wL(x)). Prove that 0 < x < y implies x* < y® fora > 0 
and x* > y® fora < 0. Also prove that 


1 
OTB & 7% XB GS ane (x?) = ax?! 
x 


for alla, 8 € Randx > 0. 
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5.3.9. Suppose that f : [a,b] — R is continuously differentiable and 1-1 on 
[a, b]. Prove that 


b f(b) 
/ Fx) dx+ / f7'(x) dx = bf (b) — af (a). 
a f(@ 


5.3.10. Suppose that ¢ is C! on [a, b] and f is integrable on [c, d] := @[a, b]. If 
¢’ is never zero on [a, b], prove that f o ¢ is integrable on [a, b]. 

5.3.11. Let g € Q. Suppose that a < b,0 < c < d,and that f : [a,b] > [c, d]. 
If f is integrable on [a, b], then prove that 


([ f4(t) ar) = f% (x) 
for all x € [a, b]. 


5.3.12. For eachn €N, define 


Prove that a, > 4/e. 


5.4 IMPROPER RIEMANN INTEGRATION 


To extend the Riemann integral to unbounded intervals or unbounded func- 
tions, we begin with an elementary observation. 


5.37 Remark. /f fis integrable on [a, b], then 
b d 
= li li : 
/ f(x) dx im, (,iim_ f(x) ax) 
Proof. By Theorem 5.26, 
x 
F(x) = / f(t) dt 
a 
is continuous on [a, b]. Thus 
b 
/ f(x) dx = F(b) — F(a) = lim ( lim (F(d) — F(o))) 
7 cat d>b— 


d 
= in (tg 10049). : 


This leads to the following generalization of the Riemann integral. 
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5.38 Definition. 


Let (a, b) be a nonempty, open (possibly unbounded) interval and f : (a,b) 
>R. 


i) f is said to be locally integrable on (a, b) if and only if f is integrable on 
each closed subinterval [c, d] of (a, b). 

ii) f is said to be improperly integrable on (a, b) if and only if f is locally 
integrable on (a, b) and 


b d 
=i li 18 
: F@)ax sim, (tim_f f(s) dr) (18) 
exists and is finite. This limit is called the improper (Riemann) integral of f 


over (a, b). 


5.39 Remark. The order of the limits in (18) does not matter. In particular, if the 
limit in (18) exists, then 


b d 
fsonse= im (a, 1004) 


Proof. Let xo € (a, b) be fixed. By Theorems 5.20 and 3.8, 


d 
als (,1im_f f(x) dx) = = lim nee f(x) dx + pie mh f(x) ax) 


= sim, [ f(x) dx + inf f(x) dx 


= ii 1 
Ce (im, ‘ f&) ax). r 
Thus we shall use the notation 
1s “fe dx 
caat 
d—>b— 


to represent the limit in (18). If the integral is not improper at one of the 
endpoints—for example, if f is Riemann integrable on closed subintervals of 
(a, b])—we shall say that f is improperly integrable on (a, b] and simplify the 
notation even further by writing 


b b 
/ f(x) dx = tim, [ f(x) dx. 
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The following example shows that an improperly integrable function need not 
be bounded. 


5.40 EXAMPLE. 
Show that f(x) = 1/,/x is improperly integrable on (0, 1]. 


Solution. By definition, 


A aes | 
—dx= li —dx= li 2-2 =a |_| 
[ f/x : slim, [ JX is Sore va) 


The following example shows that a function can be improperly integrable on 
an unbounded interval. 


5.41 EXAMPLE. 
Show that f(x) = 1/x? is improperly integrable on [1, 0<). 


Solution. By definition, 


0° d 1 1 
= dx = ii — dx = li 1—-—]=1. a 
fo eae= aim [pare jin (1-3) 
Because an improper integral is a limit of Riemann integrals, many of the 
results we proved earlier in this chapter have analogues for the improper 


integral. The next two results illustrate this principle. 


5.42 Theorem. I[f f g are improperly integrable on (a,b) and a, B € R, then 
af + Bg is improperly integrable on (a, b) and 


b b b 
[arcy + pew dx =a f poy ax +6 f genax. 
a a a 
Proof. By Theorem 5.19 (the Linear Property for Riemann Integrals), 


d d d 
iy Ge) 2 peGiyde Se / fla) dx + B / sx) de 


for alla < c <d < b. Taking the limit as c — a+ andd — b-— finishes the 
proof. a 


5.43 Theorem. [COMPARISON THEOREM FOR IMPROPER INTEGRALS]. 
Suppose that f, g are locally integrable on (a,b). If0 < f(x) < g(x) forx € 
(a,b), and g is improperly integrable on (a, b), then f is improperly integrable 


on (a, b) and 
b b 
[ fears [ eoar, 
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Proof. Fix c € (a,b). Let F(d) = [@ f(x)dx and G(d) = [4 g(x)dx ford € 
[c, b). By the Comparison Theorem for Integrals, F(d) < G(d). Since f > 0, 
the function F is increasing on [c, b]; hence F(b—) exists (see Theorem 4.18). 
Thus, by definition, f is improperly integrable on (c, b) and 


b b 
/ f(x) dx = F(b—) < G(b—-) -| g(x) dx. 


A similar argument works for the case c > a+. a 


This test is frequently used in conjunction with the following inequalities: 
|sinx| < |x| for all x € R (see Appendix B); for every w > 0 there exists a 
constant By > 1 such that | log x| < x® for all x > By (see Exercise 4.4.6). Here 
are two typical examples. 


5.44 EXAMPLE. 
Prove that f(x) = sinx/V/x3 is improperly integrable on (0, 1]. 


Proof. Since f is continuous on (0, 1], f is locally integrable there as well. 
Since f is nonnegative on (0, 1], it is clear that 0 < f(x) = | sinx /Vx3| < 
|x|/x3/* = 1/./x on (0, 1]. Since this last function is improperly integrable 
on (0, 1] by Example 5.40, it follows from the Comparison Test that f(x) is 
improperly integrable on (0, 1]. a 


5.45 EXAMPLE. 
Prove that f(x) = logx/+/x> is improperly integrable on [1, 00). 


Proof. Since f is continuous on (0,00), f is integrable on [1, C] for any 
C € [1, o«). By Exercise 4.4.6, there is a constant C > 1 such that 0 < f(x) = 
log x/Vx> < x!/?/x°/? = 1/x? for x > C. Since this last function is improp- 
erly integrable on [1, 00) by Example 5.41, it follows from the Comparison 
Theorem that f(x) is improperly integrable on [1, oo). B 


Although improperly integrable functions are not closed under multiplication 
(see Exercise 5.4.5), the Comparison Theorem can be used to show that some 
kinds of products are improperly integrable. 


5.46 Remark. /f fis bounded and locally integrable on (a, b) and |g| is improp- 
erly integrable on (a, b), then | fg| is improperly integrable on (a, b). 


Proof. Let M = supyecao)|f(@)|. Then 0 < |f(x)g(x)| < M|g(x)| for all 
x € (a, b). Hence, by Theorem 5.43, | fg| is improperly integrable on (a, b). Hl 


For the Riemann integral, we proved that |f| is integrable when f is (see 
Theorem 5.22). This is not the case for the improper integral (see Example 5.49 
below). For this reason we introduce the following concepts. 
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5.47 Definition. 
Let (a, b) be a nonempty, open interval and f : (a,b) > R. 


i) f is said to be absolutely integrable on (a, b) if and only if f is locally inte- 
grable and | f| is improperly integrable on (a, b). 

ii) f is said to be conditionally integrable on (a, b) if and only if f is improperly 
integrable but not absolutely integrable on (a, b). 


The following result, an analogue of Theorem 5.22 for absolute integrable 
functions, shows that absolute integrability implies improper integrability. 


5.48 Theorem. /f f is absolutely integrable on (a, b), then f is improperly inte- 
grable on (a, b) and 
b 
[roa 
a 
Proof. Since 0 < |f(x)| + f(x) < 2|f()|, we have by Theorem 5.43 that 


|f| + f is improperly integrable on [a,b]. Hence, by Theorem 5.42, so is 
f =(f|+ Sf) —\f|. Moreover, 


d 
/ f(x) dx 


for every a < c <d < b. We finish the proof by taking the limit of this last 
inequality asc > a+ andd —> b-. a 


b 
< | Lf(@)| dx. 


d 
<| Ifa) dx 


The converse of Theorem 5.48, however, is false. 


5.49 EXAMPLE. 


Prove that the function sin x/x is conditionally integrable on [1, oo). 


Proof. Integrating by parts, we have 


d sinx COSX id 4 cos x 
aan ae fi oF ax 
1 x X 1 Xx 


cosd 4 cosx 
= cos(1) 7 / dx. 
1 


Since 1/x? is absolutely integrable on [1, 00), it follows from Remark 5.46 that 
cos x/x? is absolutely integrable on [1, 00). Therefore, sinx/x is improperly 
integrable on [1, oo) and 


© sin x © cos x 
— dx =cos(l) — 5 dx. 
1 x 1 x 
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To show that sin x/x is not absolutely integrable on [1, oo), notice that 


"™ | sin x| " f*™ — |sinx| 
dx > pe) dx 
| x (k-l)xn * 


k=2 
n kx 
> ye — | sinx| dx 
ty K® Sk 


n n 


2 2 1 
=D 
k=2 k=2 

for eachn EN. Since 


1 k+l] n+l] 
ye) ~dx= | eek = logue Ty lee zie 


as n — ov, it follows from the Squeeze Theorem that 


: nt | sin x| 
lim — dx =o. 
noo Jy x 
Thus, sinx/x is not absolutely integrable on [1, oo). i 


EXERCISES 


5.4.0. Suppose that a < b. Decide which of the following statements are true 
and which are false. Prove the true ones and give counterexamples for 
the false ones. 


a) If f is bounded on [a, b], if g is absolutely integrable on (a, b), and 
if | f(x)| < g(x) for all x € (a,b), then f is absolutely integrable on 
(a, b). 

b) Suppose that h is absolutely integrable on (a, b). If f is continuous on 
(a, b), if g is continuous and never zero on [a, b], and if | f(x)| < A(x) 
for all x € [a, b], then f/g is absolutely integrable on (a, b). 

c) If f : (a,b) — [0, 00) is continuous and absolutely integrable on 
(a, b) for some a, b € R, then ./f is absolutely integrable on (a, b). 

d) If f and g are absolutely integrable on (a,b), then max{f, g} and 
min{ f, g} are both absolutely integrable on (a, b). 


5.4.1. Evaluate the following improper integrals. 


°C4-xX 
a) 5 ax 
1 Xx 


9 4 
b) / ape dk 
—0o 


5.4.2. 


5.4.3. 


5.4.4. 


5.4.5. 


5.4.6. 


5.4.7. 
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= 3sinx 
c) 


Tease © 
1 

d) if x" log x dx, a € (0, 00) 
0 


For each of the following, find all values of p € R for which f is improp- 
erly integrable on J. 


a) f(x) =1/x?, I = (1,00) 

b) f(x) =1/x?, 1=(,1) 

c) f(x) = 1/(@ log? x), I = (e, 00) 

d) f(x) =1/d4+x?), I = (0, oo) 

e) f(x) = log’ x/x?, where a > Ois fixed, and J = (1, oo) 


Let p > 0. Show that sinx/x? is improperly integrable on [1, oo) and 
that cos x/ log? x is improperly integrable on [e, oo). 
Decide which of the following functions are improperly integrable on I. 


a) f(x) =cosx, I = (0, oo) 

b) f(x) =1/x3, 1 =[-1,1] 

c) f(x) =x7*sin(x~?), I = (1, 00) 

d) f(x) =log(cosx), J = (0,1) 

e) f(x) = (2cosx — 2)/x?, I = (0, 0) 


Use the examples provided by Exercise 5.4.2b to show that the product 
of two improperly integrable functions might not be improperly inte- 
grable. 

Suppose that f,g are nonnegative and locally integrable on [a, b) 
and that 


L:= lim F@) 
x>b— g(x) 


exists as an extended real number. 


a) Show that if 0 < L < co and g is improperly integrable on [a, b), then 
so is f. 

b) Show that if0 < L < co and g is not improperly integrable on [a, b), 
then neither is f. 


a) Suppose that f is improperly integrable on [0, oo). Prove that if L = 


limy so f(x) exists, then L = 0. 
b) Let 


n<x<n+2",neN 
otherwise. 


f@)= 


Prove that f is improperly integrable on [0, 00) but limy— oo f(x) 
does not exist. 
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5.4.8. Prove that if f is absolutely integrable on [1, oo), then 
CO 
lim f(x") dx =0. 
n> Co 1 


5.4.9. Assuming e = lim;—.o0 )-y~9 1/k! (see Example 7.45), prove that 


1 oo) 
lim (= | x"e* ax) = 1. 
n>oo \ n! I 


‘ie ea sinx dx < 2 
0 ~~ a 
for alla > 0. 


b) What happens if cos x replaces sin x? 


5.4.10. a) Prove that 


*5.5 FUNCTIONS OF BOUNDED VARIATION 


This section uses no material from any other enrichment section. 

In this section we study functions which do not wiggle too much. These 
functions, which play a prominent role in the theory of Fourier series (see 
Sections *14.3 and *14.4) and probability theory, are important tools for 
theoretical as well as applied mathematics. 

Let ¢ : [a, b] > R. To measure how much ¢ wiggles on an interval [a, b], set 


Vo, P) = >> oj) — Oj-0) 


j=l 
for each partition P = {xo, x1,..., Xn} of [a, b]. The variation of ¢ is defined by 
Var(@) := sup{V(¢, P) : P is a partition of [a, b]}. (19) 


5.50 Definition. 


Let [a, b] be a closed, nondegenerate interval and ¢ : [a,b] > R. Then ¢ is 
said to be of bounded variation on [a, b] if and only if Var(@) < oo. 


The following three remarks show how the collection of functions of bounded 
variation is related to other collections of functions we have studied. 


5.51 Remark. If ¢ € C'[a, b], then ¢ is of bounded variation on [a, b]. However, 
there exist functions of bounded variation which are not continuously differen- 
tiable. 


Proof. Let P = {xo, x1, ...,X»,} be a partition of [a, b]. By the Extreme Value 
Theorem, there isan M > Osuch that |¢’(x)| < M for all x € [a, b]. Therefore, 
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it follows from the Mean Value Theorem that for each k between 1 and n there 
is a point cy, between x,_; and x, such that 


|p (xe) — @xe-1)| = 10 (ce) (XR — Xe-1) < M(xg — xe-1). 


By telescoping, we obtain V(¢@, P) < M(b —a) for any partition P of [a, b]. 
Therefore, 


Var(b) < M(b —a). 


On the other hand, x? sin(1/x) is of bounded variation on [0, 1] (see Exer- 
cise 5.5.2) but does not belong to C![0, 1] (see Example 4.8). a 


5.52 Remark. [fd is monotone on [a, b], then ¢ is of bounded variation on [a, b]. 
However, there exist functions of bounded variation which are not monotone. 


Proof. Let ¢ be increasing on [a, b] and P = {xo, x1, ..., Xn} be a partition of 
[a, b]. Then, by telescoping, 


Yo Io) — 6-1 = DO@@i) — o)-1) 
j=l j=l 


= b(n) — (x0) = (6) — O(a) =: M < =. 


Thus, Var(f) = M. On the other hand, by Remark 5.51, ¢(x) = x? is of 
bounded variation on [—1, 1]. 


5.53 Remark. If ¢ is of bounded variation on [a, b], then ¢ is bounded on [a, b]. 
However, there exist bounded functions which are not of bounded variation. 


Proof. Let x € [a, b] and note by definition that 


Ip(x) — $(@)| < 16) — 6(@)| + 16) — @@)| < Var@). 
Hence, by the Triangle Inequality, 


|p (x)| < |p(a)| + Var(@). 


To find a bounded function which is not of bounded variation, consider 


__ jsind/x) x £0 


Clearly, ¢ is bounded by 1. On the other hand, if 


0 jJ=0 
ad (ee ore | 
(n-—jtl)x 
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then 
n 
Yo 16 (xj) — 6@j-1)| = 2 > 00 
j=l 
as n — oo. Thus ¢ is not of bounded variation on [0, 2/7]. |_| 


The following result and Exercise 5.5.3 are partial answers to the question, Is 
the class of functions of bounded variation preserved by algebraic operations? 


5.54 Theorem. [f ¢ and are of bounded variation on a closed interval [a, b], 
then so ared+w and o —w. 


Proof. Leta = x9 < x1 < ++: <x, =b. Then 


oo ldap) twa) — @@j-1) + VOj-1)| 


j=l 
< 5 l6@) - 6@)-DI+ 6 WO) - ¥O;-DI 
j=l j=l 
< Var(@) + Var(w). 
Therefore, Var(@ + yy) < Var(@) + Var(w). a 


It turns out that there is a close connection between functions of bounded 
variation and monotone functions (see Corollary 5.57 below). To make this 
connection clear, we introduce the following concept. 


5.55 Definition. 


Let ¢ be of bounded variation on a closed interval [a, b]. The total variation of 
¢ is the function @ defined on [a, b] by 


k 
P(x) := sup ys Ip(xj) — O(xj-1) |: {X0, 41, .-., xe} iS a partition of [a, x] 
j=l 


5.56 Theorem. Let ¢ be of bounded variation on [a, b| and © be its total varia- 
tion. Then 


i) |d(y) — o(x)| < @(y) — G(x) foralla<x<y<b, 
ii) ® and ® — ¢ are increasing on [a, b], and 
iti) Var(¢) < Var(®). 
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Proof. i) Let x < y belong to [a,b] and {xo, x1,...,x¢} be a partition of 
[a,x]. Then {xo, x1,..., xx, y} is a partition of [a, y], and we have by Defi- 
nition 5.55 that 


k k 


S> le) — o@j-DI < D616) — o@j-DI F160) — 6) < G0). 
j=l j=l 
Taking the supremum of this inequality over all partitions {xo, x1,..., xx} of 


[a, x], we obtain 


P(x) < P(x) + |o(y) — 6@)| S$ PO). 


ii) By the Monotone Property of Suprema, @ is increasing on [a,b]. To 
show that @ — ¢ also increases, suppose that a < x < y < b. By parti), 


b(y) — Oo) S 1O(y) — @)| Ss O(y) — P(X). 


Therefore, (x) — d(x) < ®(y) — (9). 
iii) Let P = {xo, x1,..., 2X} be a partition of [a, b]. By part i) and Defini- 
tion 5.50, 


S-lb(xj) — b0j—-D) < Y5 |G (xj) — O(xj-1)] < Var). 


j=l j=l 


Taking the supremum of this inequality over all partitions P of [a,b], we 
obtain Var(¢) < Var(®). 


5.57 Corollary. Let [a, b] be a closed interval. Then ¢ is of bounded variation on 
[a, b] if and only if there exist increasing functions f, g on [a, b] such that 


p(x) = f(x) — ga), x € [a, b]. 


Proof. Suppose that ¢ is of bounded variation and let ® represent the total 
variation of ¢. Set f = ® and g = @ — @¢. By Theorem 5.56, f and g are 
increasing, and by construction, @ = f — g. 

Conversely, suppose that ¢ = f — g for some increasing f, g on [a, b]. Then 
by Remark 5.52 and Theorem 5.54, ¢ is of bounded variation on [a, b]. B 


In particular, if f is of bounded variation on [a, b], then 


i) f(x+) exists for each x € [a, b) and f(x—) exists for each x € (a, b] (see 
Theorem 4.18), 
ii) f has no more than countably many points of discontinuity in [a, b] (see 
Theorem 4.19), and 
iii) f is integrable on [a, b] (see Exercise 5.1.8). 
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EXERCISES 


5.5.1. a) Show that 4k/(4k? — 1) > 1/k fork EN. 
b) Prove that 


ei ft 
yi o> / — dx = log(2") 
kel k 1 Xx 


for alln EN. 
c) Prove that 


o(x) = 


is not of bounded variation on [0, 1]. 


5.5.2. a) Show that (8k* + 2)/(4k? — 1)* < 1/k? fork =2,3,.... 
b) Prove that 


forn EN. 
c) Prove that 


Deen’ 
x* sin — x £0 
x 


x=0 


eax)= 


is of bounded variation on [0, 1]. 


5.5.3|. This exercise is used in Section *14.3. Suppose that ¢ and yw are of 
bounded variation on a closed interval [a, b]. 


a) Prove that w¢ is of bounded variation on [a, b] for every a € R. 
b) Prove that dw is of bounded variation on [a, b]. 
c) If there is an ¢9 > 0 such that 


p(x) = £0, x € [a,b], 


prove that 1/¢ is of bounded variation on [a, b]. 


5.5.4. Suppose that ¢ is of bounded variation on a closed, bounded interval 
[a, b]. Prove that ¢ is continuous on (a, b) if and only if ¢ is uniformly 
continuous on (a, D). 

5.5.5. a) If ¢ is continuous on a closed nondegenerate interval [a, b], differ- 
entiable on (a, b), and if ¢’ is bounded on (a, b), prove that ¢ is of 
bounded variation on [a, b]. 

b) Show that ¢(x) = </x is of bounded variation on [—1, 1] but ¢’ is 
unbounded on (—1, 1). 
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5.5.6. Let P be a polynomial of degree N. 


a) Show that P is of bounded variation on any closed interval [a, b]. 
b) Obtain an estimate for Var(P) on [a, b], using values of the derivative 
P'(x) at no more than N points. 


5.5.7. Let ¢@ be a function of bounded variation on [a, b] and @ be its total 
variation function. Prove that if ® is continuous at some point xo € 
(a, b), then ¢ is continuous at xo. 

5.5.8, This exercise is used in Section *14.4. If f is integrable on [a, dD], 
prove that 


F(x) = ib f(t) dt 


is of bounded variation on [a, b]. 
5.5.9. Suppose that f’ exists and is integrable on [a,b]. Prove that f is of 
bounded variation and 


b 
Var(f) = / Lf’ @ol de. 


If f’ is bounded rather than integrable, how do the upper and lower 
integrals of f’ compare to the variation of f? 


*5.6 CONVEX FUNCTIONS 


The last two results of this section use enrichment Theorems 4.19 and 4.23. 

In this section we examine another collection of functions which is impor- 
tant for certain applications, especially for Fourier analysis, functional analysis, 
numerical analysis, and probability theory. 


5.58 Definition. 
Let J be an interval and f:/ > R. 


i) f is said to be convex on J if and only if 


flax +(1—-a)y) <af(x) +d —-a) f(y) 


forall0 <a <1landallx,yel. 
ii) f is said to be concave on J if and only if — f is convex on /. 


Notice that, by definition, a function f is convex on an interval J if and only 
if f is convex on every closed subinterval of J. 

It is easy to check that f(x) = mx + b is both convex and concave on any 
interval (see also Exercise 5.6.3) but in general it is difficult to apply Defini- 
tion 5.58 directly. For this reason, we include the following simple geometric 
characterizations of convexity. 
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FIGURE 5.5 


5.59 Remark. Let I be an interval and f : I > R. Then f is convex on I if and 
only if given any [c,d] C I, the chord through the points (c, f(c)), (d, f (d)) lies 
on or above the graph of y = f (x) for all x € [c, d]. (See Figure 5.5.) 


Proof. Suppose that f is convex on J and that xo € [c, d]. Choose 0 < a < 1 
such that x9 = ac + (1 — a)d. The chord from (c, f(c)) to (d, f(d)) has slope 
(f(d) — f(c))/(d — c). Hence, the point on this chord which has the form 
(xo, yo) must satisfy vp = af(c) + (1 — a) f(d). Since f is convex, it follows 
that f(xo) < yo; that is, the point (x, yo) lies on or above the point (x9, f (x)). 
A similar argument establishes the reverse implication. a 


Thus both f(x) = |x| and f(x) = x? are convex on any interval. 


5.60 Remark. A function f is convex on a nonempty, open interval (a, b) if and 
only if the slope of the chord always increases on (a, b); that is, if and only if 


fo)- fl) _ f@)~ fe) 


a<c<x<d<b_ implies 
x—Cc d—-x 


Proof. Fix a <c <x <d < band let A(x) be the equation of the chord to 
f through the points (c, f(c)) and (d, f(d)). If f is convex, then f(x) < A(x) 
(see Figure 5.6). Therefore, 


fe) flO) _ A@)-AO) _ A) - AB) _ f@M=f@) 


x—-—C x—C d—-x a d-x 
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y=f(x) 


_—— “a 
~ y=A) 
| 
| 
ye Chords~ | 
| 
| | 
| | 
| | 
| | | 
| | | 
c a d x 
FIGURE 5.6 


Conversely, if f is not convex, then A(x) < f(x) for some x € (c, d). It follows 
that 


fO@)-~FO | AG)—-AO) _A@M—Aw) _ F/O — FE) 


x—C x—C d-x d—-x 


Therefore, the slope of the chord does not increase on (a, D). H 
This leads us to a characterization of differentiable convex functions. 


5.61 Theorem. Suppose that f is differentiable on a nonempty, open interval I. 
Then f is convex on I if and only if f’ is increasing on I. 


Proof. Suppose that f is convex on J =: (a,b) and that c,d € (a, b) satisfy 
c < d. Choose h > 0 so small thatc +h < dandd+h < b. Then by 
Remark 5.60, 


f(e+M-fO — fd+h)—~f@ 
h ~ h ; 
In particular, f’(c) < f’(d). 
Conversely, suppose that f’ is increasing on (a,b). Leta <c<x<d<b 


and use the Mean Value Theorem to choose xo (between c and x) and x; 
(between x and d) such that 


f@)-fO _ 


ASC 


=f Ga): 


d)— 
fio) and LO=FO) 
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Since x9 < x1, it follows that f’(xo) < f’(x1). In particular, we conclude by 
Remark 5.60 that f is convex on (a, b). a 


Combining Theorems 4.17 and 5.61, we obtain the usual convexity criterion 
in terms of the second derivative: If f is twice differentiable on (a, b), then f 
is convex on (a,b) if and only if f”(x) > 0 for all x € (a,b). In particular, 
convexity is what elementary calculus texts call concave upward and concavity 
is what elementary calculus texts call concave downward. 

On open intervals, convex functions are always continuous. (The statements 
and proofs of the next two results come from Zygmund [15].) 


5.62 Theorem. /f f is convex on some nonempty, open interval I, then f is con- 
tinuous on I. 


Proof. Let x9 € I =: (a,b). By symmetry, it suffices to show that f(x) > 
f(xo) as x > xpt+. Leta <c < x9 < x <d <b, y = g(x) represent the 
equation of the chord through (c, f(c)), (xo, f(xo)), and y = A(x) represent 
the equation of the chord through (xo, f(xo)), (d, f(d)). Since f is convex, we 
have by Remark 5.59 that f(x) < h(x). Since f (xo) lies on or below the chord 
from (c, f(c)) to (x, f(x)), we also have that g(x) < f(x). Consequently, 


g(x) < f(x) < h@), x € (x0, db). 


Both chords y = g(x) and y = h(x) pass through the point (xo, f(%0)), so 
g(x) > f(xo) and h(x) > f(xo) as x > xo+. Hence, it follows from the 
Squeeze Theorem that f(x) > f (xo) as x > xo+. A 


Theorem 5.62 does not hold for closed intervals [a, b]. Indeed, the function 


0 O<x<l 
f(x) = tL. eet 
is convex on [0, 1] but not continuous there. 

A function f is said to have a proper maximum (respectively, proper mini- 
mum) at xo if and only if there exists a 6 > 0 such that f(x) < f(xo) [respec- 
tively, f(x) > f(xo)] for all 0 < |x — xo] < 5. As far as proper extrema are 
concerned, convex functions behave like strictly increasing functions. 


5.63 Theorem. 


i) If fis convex on a nonempty, open interval (a, b), then f has no proper maxi- 
mum on (a, b). 

ii) If f is convex on [0, 00) and has a proper minimum, then f(x) — oo as 
x > OO. 


Proof. i) Suppose that x9 € (a,b) and that f(xq) is a proper maximum of 
f. Then there exist c < x9 < d such that f(x) < f(xo) forc < x <d. In 
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particular, the chord through (c, f(c)), (d, f(d)) must lie below f (xo) for c, d 
near xo, a contradiction. 

ii) Suppose that xo € (a,b) and that f(xo) is a proper minimum of f. Fix 
x1 > xo. Let y = g(x) represent the equation of the chord through (x9, f (x0)) 
and (x1, f(x1)). Since f(xo) is a proper minimum, f(x) > f (xo); hence, g 
has positive slope. Moreover, by the proof of Theorem 5.62, g(x) < f(x) for 
all x € (x1, 00). Since g(x) > co as x > oo, we conclude that f(x) > oo as 
x > OO. a 


Another important result about convex functions addresses the question, 
What happens when we interchange the order of a convex function and an inte- 
gral sign? 


5.64 Theorem. [JENSEN’S INEQUALITY]. 
Let @ be convex on a closed interval [a, b] and f : [0,1] — [a, b]. Iffand go f 
are integrable on (0, 1], then 


1 1 
o(| fea] <| (po f)(x) dx. (20) 


Proof. Set 


1 
al f(x) dx 
0 


and observe that 


1 1 
o(| roar) =o00+s(f fosyax—e] (21) 


for alls € R. (Note: Since a < f(x) < b for each x ¢€ [0, 1], c must belong 
to the interval [a, b] by the Comparison Theorem for Integrals. Thus ¢(c) is 
defined.) 

Let 


_ $(c) — b(x) 
s= sup ————. 
x€[a,c) cx 


By Remark 5.60, s < (¢(u) — ¢(c))/(u —c) for all u € (c, b]; that is, 
b(c) + s(u—c) < du) (22) 


for all u € [c, b]. On the other hand, if u € [a, c), we have by the definition of 
s that 


& P(e) — GH) 


c—u 
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Thus (22) holds for all u € [a, b]. Applying (22) to u = f(x), we obtain 
p(c) +s(f(x) —c) < bo fy). 


Integrating this inequality as x runs from 0 to 1, we obtain 


1 1 
wo+s(| fos) ax~e) <[ (po f)(x) dx. 


Combining this inequality with (21), we conclude that (20) holds. a 


What about differentiability of convex functions? To answer this question we 
introduce the following concepts (compare with Definition 4.6). 


5.65 Definition. 
Let f : (a,b) > Rand x € (a, D). 


i) f is said to have a right-hand derivative at x if and only if 


h) — 


exists as an extended real number. 
ii) f is said to have a left-hand derivative at x if and only if 


Disc) := tim LEAD LO 


exists as an extended real number. 


The following result is a simple consequence of the definition of differen- 
tiability and the characterization of two-sided limits by one-sided limits (see 
Theorem 3.14). 


5.66 Remark. A real function f is differentiable at x if and only if both Dr f (x) 
and D_ f (x) exist, are finite, and are equal, in which case f'(x) = Drf(x) = 


Dif (x). 


The next result shows that the left-hand and right-hand derivatives of a con- 
vex function are remarkably well-behaved. 


5.67 Theorem. Let f be convex on an open interval (a, b). Then the left-hand 
and right-hand derivatives of f exist, are increasing on (a, b), and satisfy 


—oo < Dy f(x) < Drf (x) < 


for all x € (a, b). 
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Proof. Let h < 0 and notice that the slope of the chord through the points 
(x, f(x)) and (x +h, f(x +A)) is (f & +h) — f(x))/h. By Remark 5.60, these 
slopes increase as h — O-—. Since increasing functions have a limit (which 
may be +0o0), it follows that Dz; f(x) exists and satisfies —oo < D; f(x) < &. 
Similarly, Dr f (x) exists and satisfies —co < Dr f(x) < co. Remark 5.60 also 
implies 


Dy f (x) < Drf (x). (23) 


Hence, both numbers are finite, and by symmetry it remains to show that 
Dr f (x) is increasing on (a, D). 
Let xj <u <t < x2 be points which belong to (a, b). Then 


fu) = fa) _ FG) — FO 


u—xX{ x2 —-T 


Taking the limit of this inequality as u > x;+ and t — x2—, we conclude by 
(23) that 


Drf (x1) < Dif (x2) < Dr f (x2). (24) 
| 


The next proof uses enrichment Theorem 4.19. 


*5.68 Corollary. If f is convex on an open interval (a, b), then f is differentiable 
at all but countably many points of (a, b); that is, there is an at most countable set 
E C (a,b) such that f'(x) exists for all x € (a, b) \ E. 


Proof. Let E be the set where either D; f(x) or Dr f(x) is discontinuous. 
By Theorems 5.67 and 4.19, the set E is at most countable. Suppose that 
xo € (a,b) \ E and that x < xo. By (24), 


Drf (x) < Dif (xo) < Drf (xo). 


Let x > xo. Since both Dz; f(x) and Dr f (x) are continuous at x9, we obtain 
Drf (xo) < Dif (xo) < Drf (xo). In particular, f’(xo) exists for all x9 € 
(a,b) \ E. | 


How useful is a statement about f’(x) which holds for all but countably many 
points x? We address this question by proving a generalization of Theorem 4.17. 
(The proof here uses enrichment Theorem 4.23.) 


*5.69 Theorem. Suppose that f is continuous on a closed interval [a, b] and dif- 
ferentiable on (a,b). If f'(x) = 0 for all but countably many x € (a, b), then f is 
increasing on [a, b]. 


Proof. Suppose that f’(x,) < 0 for some x; € (a, b) and let y € (f’(x1), 0). By 
Theorem 4.23 (the Intermediate Value Theorem for derivatives), there is an 
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x = x(y) € (a, b) such that f’(x) = y < 0. It follows that if f’(x) < 0 for one 
x € (a,b), then f’(x) < 0 for uncountably many x € (a, b), a contradiction. 
Therefore, f’(x) > 0 for all x € (a, b); hence, by Theorem 4.17, f is increasing 
on (a, D). a 


*5.70 Corollary. If f is continuous on a closed interval [a, b] and differentiable 
on (a,b) with f'(x) = 0 for all but countably many x € (a, b), then f is constant 
on [a, b]. 


EXERCISES 


5.6.1. 


5.6.2. 


5.6.3. 


5.6.4. 


5.6.5. 


5.6.6. 


5.6.7. 


Suppose that f, g are convex on an interval J. Prove that f + g and cf 
are convex on / for any c > 0. 

Suppose that f, is a sequence of functions convex on an interval / 
and that 


f(x) = lim, fa) 


exists for each x € J. Prove that f is convex on I. 

Prove that a function f is both convex and concave on / if and only if 
there exist m, b € Rsuch that f(x) =mx +b forx € J. 

Prove that f(x) = x? is convex on [0, co) for p > 1, and concave on 
[0, co) forO < p <1. 

Show that if f is increasing on [a, b], then 


F(x) = 3 f(t)dt 


is convex on [a,b]. (Recall that by Exercise 5.1.8, f is integrable on 
[a, b].) 
If f : [a,b] — Ris integrable on [a, b], prove that 


b b 1/2 
[itera s 0-01? (f 2) as) . 


Suppose that f : [0, 1] — [a, b] is integrable on [0, 1]. Assume that ef 
and | f (x)|? are integrable for all 0 < p < oo (see Exercise 12.2.11). 


a) Prove that 


D 1 Ae 1 
eh to) dr < f eJ™ dx and (/ Yor a] <[ | f(x)| dx 
0 0 0 


forallO <r <1. 
b) If0 < p <q, prove that 


1 1/p 1 1/q 
(/ lf)? ix) ns (/ | f(x)|4 ix) 
0 0 


c) State and prove analogues of these results for improper integrals. 
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*5.6.8. Let f be continuous on a closed, bounded interval [a, b] and suppose 
that Dr f (x) exists for all x € (a, b). 


a) Show that if f(b) < yo < f(a), then 
xo := sup{x € [a,b]: f(x) > yo} 


satisfies f (xo) = yo and Dr f (xo) < 0. 

b) Prove that if f(b) < f(a), then there are uncountably many points x 
which satisfy Dr f(x) < 0. 

c) Prove that if Dr f(x) > 0 for all but countably many points x € (a, b), 
then f is increasing on [a, b]. 

d) Prove that if De f(x) => 0 and g(x) = f(x) + x/n for some n € N, 
then Drg(x) > 0. 

e) Prove that if Dr f(x) => 0 for all but countably many points x € (a, b), 
then f is increasing on [a, b]. 


CHAPTER 6 


Infinite Series of Real Numbers 


Infinite series are one of the most widely used tools of analysis. They are used 
to approximate numbers and functions. (Series of Ramanujan type have been 
used to compute billions of digits of the decimal expansion of 2.) They are 
used to approximate solutions of differential equations. (You may have used 
power series to solve ordinary differential equations with nonconstant coeffi- 
cients.) They even form the basis for some very practical applications, including 
pattern recognition (e.g., reading zip codes), image enhancement (e.g., removing 
raindrop clutter from a radar scan), and data compression (e.g., transmission of 
hundreds of TV programs through a single, photonic, fiber optic cable). Other 
applications of infinite series can be found in Section 7.5. In view of the variety 
of these applications, it should come as no surprise that the subject matter of 
this chapter (and the next) is of fundamental importance. 


6.1 INTRODUCTION 


Let {ax},en be a sequence of numbers. We shall call an expression of the form 


ya (1) 


an infinite series with terms az. (No convergence is assumed at this point. This is 
merely a formal expression.) 


Il 
a 


6.1 Definition. 
Let S = ‘aa , %& be an infinite series with terms ax. 


i) For eachn EN, the partial sum of S of order n is defined by 


n 
way a: 
k=1 


ii) S is said to converge if and only if its sequence of partial sums {s,,} con- 
verges to some s € Rasn —> ox; that is, if and only if for every e > 0 there 
isan N € Nsuch that n > N implies |s, — s| < e. In this case we shall write 
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Yas (2) 
k=1 


and call s the sum, or value, of the series )77~ | ax. 

iii) S is said to diverge if and only if its sequence of partial sums {s,} does 
not converge as n — oo. When s, diverges to +oo as n — oo, we shall 
also write 


Ce 
yo 
k=1 


(We shall deal with series of functions in Chapter 7.) 

You are already familiar with one type of infinite series, decimal expan- 
sions. Every decimal expansion of a number x € (0, 1) is a series of the form 
Ye x«/10*, where the x,’s are integers in [0, 9]. For example, when we write 
1/3 = 0.333... we mean 


{, se 
3 = Lig 


In particular, the partial sums 0.3, 0.33, 0.333,...are approximations to 1/3 
which get closer and closer to 1/3 as more terms of the decimal expansion 
are taken. 

One way to determine if a given series converges is to find a formula for its 
partial sums simple enough so that we can decide whether or not they converge. 
Here are two examples. 


6.2 EXAMPLE. 
Prove that )77°., 2-* = 1. 


Proof. By induction, we can show that the partial sums s, = )-7_, 1/2* satisfy 
S, =1—2°-" forn EN. Thus s, > lasn > o. | 


6.3 EXAMPLE. 
Prove that }-?° ,(—)* diverges. 


Proof. The partial sums s, = )-{_,(—D* satisfy 


—1 if n is odd 


So Sees 
0 if n is even. 
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Thus s, does not converge as n > ov. | 
Another way to show that a series diverges is to estimate its partial sums. 


6.4 EXAMPLE, [THE HARMONIC SERIES]. 


Prove that the sequence 1/k converges but the series }°7°., 1/k diverges to +00. 


Proof. The sequence 1/k converges to zero (by Example 2.21). On the other 
hand, by the Comparison Theorem for Integrals, 


1 n k+l y n+l] 
“= |, vax = f _ ae = log + 1). 


We conclude that s, - oo asn > oo. a 


This example shows that the terms of a divergent series may converge. In 
particular, a series does not converge just because its terms converge. On the 
other hand, the following result shows that a series cannot converge if its terms 
do not converge to zero. 


6.5 Theorem. [DIVERGENCE TEST]. 
Let {ag}zen be a sequence of real numbers. If ay, does not converge to zero, then 
the series )-?~_ ax diverges. 


Proof. Suppose to the contrary that }°7°, a, converges to some s € R. By 
definition, the sequence of partial sums s, := )°/_) ax converges to 5 as 
n — oo. Therefore, ay = sy — sp) > 5 —8S =Oask > co, a contradiction. 


The proof of this result establishes a property interesting in its own right: If 
ye ak converges, then a, — 0 as k — oo. It is important to realize from the 
beginning that the converse of this statement is false; that is, Theorem 6.5 is a 
test for divergence, not a test for convergence. Indeed, the harmonic series is a 
divergent series whose terms converge to zero. 

Finding the sum of a convergent series is usually difficult. The following two 
results show that this is not the case for two special kinds of series. 


6.6 Theorem. [TELESCOPIC SERIES]. 
If {ax} is a convergent real sequence, then 


Co 


Dok = ae41) = a1 — im ag. 
= k>oo 
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Proof. By telescoping, we have 


n 
Sni= Sax — 4k41) = a — anti. 
k=1 


Hence, s, — a; — limp_so9 ay aSn > OO. | 
It’s clear how to modify Definition 6.1 to accommodate series that start at 


some index other than k = 1. We use this concept in the following very impor- 
tant result. 


6.7 Theorem. [GEOMETRIC SERIES]. 


Suppose that x € R, that N € {0,1,...}, and that 0° is interpreted to be 1. Then 
the series S-°°_y x* converges if and only if |x| < 1, in which case 


oo N 
XxX 

) x* = 1 . 
—xX 

k=N 


In particular, 
Co 
1 
2 e TCO <1. 
k=0 = 


Proof. If |x| > 1, then 7° y x* diverges by the Divergence Test. If |x| < 1, 
then set s, = )-7_, x* and observe by telescoping that 


(1 =*)s,= 0 = Gtx? bee $ x”) 


eh ey ge ee eg he pS rk, 


Hence, 


for alln € N. Since x”*! — 0 asn — oo for all |x| < 1 (see Example 2.20), 
we conclude that s, > x/(1—.x) asn > o. 
For general N, we may suppose that |x| < | and x 4 0. Hence, 


n n—N+1 
a ae ae a xk. 
k=N k=1 


Hence, it follows from Definition 6.1 and what we’ve already proved that 
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In everyday speech, the words sequence and series are considered synonyms. 
Example 6.4 shows that in mathematics, this is not the case. In particular, you 
must not apply a result valid for sequences to series and vice versa. Nevertheless, 
because convergence of an infinite series is defined in terms of convergence of 
its sequence of partial sums, any result about sequences contains a result about 
infinite series. The following three theorems illustrate this principle. 


6.8 Theorem. [THE CAUCHY CRITERION]. 
Let {ax} be a real sequence. Then the infinite series )~y~ , ax converges if and 
only if for every ¢ > 0 there is an N € N such that 


m 
m>n>N_ imply Soak <€. 
k=n 


Proof. Let s, represent the sequence of partial sums of )°?°, ag and set 
so =0. By Cauchy’s Theorem (Theorem 2.29), s, converges if and only if 
given e > 0 there isan N € Nsuch that m,n > N imply |s,, — s»_1| < €. Since 


m 
Sm — Sn-1 = So ax 
k=n 


for all integers m > n > 1, the proof is complete. a 


6.9 Corollary. Let {ax} be a real sequence. Then the infinite series )- 7, ax con- 
verges if and only if given ¢ > 0 there is an N € N such that 


oo 
doa 
k=n 


6.10 Theorem. Let {ax} and {bx} be real sequences. If )-7°., ax and 7, by are 
convergent series, then 


n>WN_ implies <€. 


Co Ce Ce 

Yi ax + bi) = Doan + Dobe 

k=1 k=1 k=1 
and 


Co lo.e) 
Du (aan) =a) ak 
k=1 k=1 


foranya eR. 


Proof. Both identities are corollaries of Theorem 2.12; we provide the details 
only for the first identity. 
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Let s, represent the partial sums of }°7°, ax and t, represent the partial 
sums of )°?-., bg. Since real addition is commutative, we have 


n 


YG + bk) = Sn + tn, neN. 
k=1 


Taking the limit of this identity as n — oo, we conclude by Theorem 2.12 that 


CO CO CO 
Dia + be) = Tim sy + lim ty = Yay + De. 7 
k=1 k=1 k=1 


EXERCISES 


6.1.0. Let {a,} and {b,} be real sequences. Decide which of the following state- 
ments are true and which are false. Prove the true ones and give coun- 
terexamples to the false ones. 


a) If ag is strictly decreasing and a, — 0 ask > ov, then )°7°, ag con- 
verges. 

b) If a, 4 by for all k € N and if > (ae + bx) converges, then either 
yr ak converges or 7?~. | by converges. 

c) Suppose that S772. (ax + bx) converges. Then }°7°, ax converges if 
and only if }°7°, bg converges. 

d) Ifa, > aask > o, then 


CO 
Sax — dk42) = ay + a2 — 2a. 
k=1 


6.1.1. Prove that each of the following series converges and find its value. 


foe) ( 1)7*-! 
a) & ore 
lore) ( rye 
b 
2 oe 
oe) 3k-3 
9 & Sit 
love) ok+1 = ( Phys 
x 2 ate 
6.1.2. Represent each of the following series as a telescopic series and find 
its value. 
oo 6-2 
) ek 
Co = 
b) ° 


cay (kK +:1) (kK +2) 
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6.1.3. 


6.1.4. 


6.1.5. 


6.1.6. 


6.1.7. 


Infinite Series of Real Numbers 


ae (k + 1) =) 
] we Oe eae 
9 & = ( (k + 2)? 


oo 3 3 3 3 
d 4si 
) 2 sn(; ai et 


Prove that each of the following series diverges. 


Let ao, a1, ... be a sequence of real numbers. If a, — L as k > oo, does 


CO 
YS (@ei — Zax + ae—1) 
k=l 


converge? If so, what is its value? 
Find all x € R for which 


oe Lay xk at) 
k=1 


converges. For each such x, find the value of this series. 


a) Prove that if )°?° , ax converges, then its partial sums s, are bounded. 
b) Show that the converse of part a) is false. Namely, show that a series 
ye ae may have bounded partial sums and still diverge. 


Suppose that / is a closed interval and xg € 7. Suppose further that f is 
differentiable on R, that f’(a) 4 0 for some a € R, that the function 


_ f@) 
f'(@) 

satisfies F(J) C J, and that there is a number 0 < r < 1 such that 

f'@)/f'@ ¢fl—r, 1) for all x € J. 

a) Prove that |F(x) — F(y)| <r|x — y| for allx, y € J. 

b) If xp := F(m-_1) forn €N, prove that |x,41 —%n| < r”|x1 — xo| for all 


neNn. 
c) If x, = Xn-1 — f (Xn_-1)/f' (a) forn €N, prove that 


xeER 


F(x) :=x 


b:= lim x 
n—->Co 


exists, belongs to J, and is a root of f; that is, that f(b) = 0. 


6.1.8. 


6.1.9. 


6.1.10. 


6.1.11. 
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a) Suppose that {az} is a decreasing sequence of real numbers. Prove 
that if )°7°, ax converges, then ka; > 0 as k > ov. 

b) Let sy = St_,)(-1)"!/k for n € N. Prove that 52, is strictly increas- 
iINg, 52n+1 is strictly decreasing, and 52,41 — 52, > Oasn > oo. 

c) Prove that part a) is false if decreasing is removed. 


Let {b,} be a real sequence andb € R. 


a) Suppose that there are M, N € Nsuch that |b—b,;| < M forallk > N. 
Prove that 


N 
< ol —b| + Ma —N) 
k=1 


nb — Sh 


k=1 


for alln > N. 
b) Prove that if b, > bask — ov, then 


by +by +--+ + bn 
n 


>b 


asn —> oOo. 
c) Show that the converse of b) is false. 


A series )-7~-9 ax is said to be Cesdro summable to an L € Rif and only if 


converges to L asn — ov. 


a) Let s, = 729 ax. Prove that 


for eachn EN. 

b) Prove that if a, € Rand )° ya, = L converges, then )°7° 5 ax is 
Cesaro summable to L. 

c) Prove that }°72.9(—1)* is Cesaro summable to 1/2; hence the converse 
of b) is false. 

d) [TAuBER’s THEOREM]. Prove that if a, > 0 fork € Nand )°?°9 a, is 
Cesaro summable to L, then borer a, = L. 


Suppose that a, > 0 fork large and that }°7°, ax/k converges. Prove that 


CO 

; ak 

lim ) = 0. 
jae +k 


jr~w 
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6.1.12. If \-¢_, kax = (n + 1)/(n + 2) for n EN, prove that 


~ 
ll 
pes 


6.2 SERIES WITH NONNEGATIVE TERMS 


Although we obtained exact values in the preceding section for telescopic series 
and geometric series, finding exact values of a given series is frequently difficult, 
if not impossible. Fortunately, for many applications it is not as important to 
be able to find the value of a series as it is to know that the series converges. 
When it does converge, we can use its partial sums to approximate its value as 
accurately as we wish (up to the limitations of whatever computing device we 
are using). Therefore, much of this chapter is devoted to establishing tests which 
can be used to decide whether a given series converges or whether it diverges. 

The partial sums of a divergent series may be bounded [like }°7° ,(—1)*] or 
unbounded [like }°7°., 1/k]. When the terms of a divergent series are nonnega- 
tive, the former cannot happen. 


6.11 Theorem. Suppose that a, > 0 for large k. Then )~7~, a,x converges if and 
only if its sequence of partial sums {sn} is bounded; that is, if and only if there 
exists a finite number M > 0 such that 


n 


> 


k=1 


<M forallneN. 


Proof. Set sy = Y-~_, u% forn EN. If )°7°., ax converges, then s, converges as 
n —> oo. Since every convergent sequence is bounded (Theorem 2.8), )77> ; ax 
has bounded partial sums. 

Conversely, suppose that |s,| < M forn e€ N. Recall from Section 2.1 
that a, > 0 for large k means that there is an N € N such that q > 0 for 
k > N. It follows that s, is an increasing sequence when n > N. Hence by the 
Monotone Convergence Theorem (Theorem 2.19), s, converges. a 


If a, > O for large k, we shall write oy ay < oo when the series is convergent 
and °° | az = co when the series is divergent. 

In some cases, integration can be used to test convergence of a series. The 
idea behind this test is that 


[e,2) 


oo 00 k+1 
[ teoa=S[ peoar~ rw 
k=1 


k=1 


when f is almost constant on each interval [k, k + 1]. This will surely be the case 
for large k if f(k) | 0 as k > o (see Figure 6.1). This observation leads us to 
the following result. 
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#Y 


1 2 3 4 Dee -5 


FIGURE 6.1 


6.12 Theorem. [INTEGRAL TEST]. 

Suppose that f : [1,c00) — R is positive and decreasing on [1, 0). Then 
ye £(k) converges if and only if f is improperly integrable on (1, oo); that 
is, if and only if 


[ feoas < OO. 
1 


Proof. Let 5, = > j_, f(k) and t, = fy’ f(x)dx for n € N. Since f is 
decreasing, f is locally integrable on [1, 00) (see Exercise 5.1.8) and f(k+1) < 
f(x) < f(® for all x € [k,k + 1]. Hence, by the Comparison Theorem for 
Integrals, 


k+1 


fk+)s f(x)dx < fk) 
k 


for k € N. Summing over k = 1,...,n — 1, we obtain 
n n n—1 
mf) = £0) 5 fo fayds= ms Y £0) = 5 — Fon) 
k=2 k=1 


for alln > N. In particular, 


ra) < fh) [ Sie e270) bia EN (3) 
k=l ! 
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By (3) it is clear that {s,} is bounded if and only if {f,} is. Since f(x) => 0 
implies that both s, and ¢, are increasing sequences, it follows from the Mono- 
tone Convergence Theorem that s, converges if and only if ¢, converges, as 
n—> O. a 


This test works best on series for which the integral of f can be easily com- 
puted or estimated. For example, to find out whether )77°,, 1/(1 +k?) converges 
or diverges, let f(x) = 1/(+ x”) and observe that f is positive on [1, co). Since 
f'(x) = —2x/(1 + x”)? is negative on [1, 00), it is also clear that f is decreasing. 
Since 


oo dx be. 9E 
= arctan x F = — —arctan(1) < o~, 
1 T+ x2 2 


it follows from the Integral Test that )°7°, 1/(1 +k”) converges. 
The Integral Test is most widely used in the following special case. 


6.13 Corollary. [p- SERIES TEST]. The series 
5 (4) 


converges if and only if p > 1. 


Proof. If p = 1 or p < 0, the series diverges. If p > 0 and p £ 1, set 
f (x) = x7? and observe that f’(x) = —px~?—! < 0 for all x € [1, oo). Hence, 
f is nonnegative and decreasing on [1, oo). Since 


© dx ee ae ee ae | 
—= lim = lim ————— 
1 xP n>oo ] — p n>oo J] — Dp 


has a finite limit if and only if 1 — p < 0, it follows from the Integral Test that 
(4) converges if and only if p > 1. a 


The Integral Test, which requires f to satisfy some very restrictive hypothe- 
ses, has limited applications. The following test can be used in a much broader 
context. 


6.14 Theorem. [COMPARISON TEST]. 
Suppose that 0 < ag < bx for large k. 


i) Tf rr be < 00, then Y °°, ag < 00. 


ii) If po, ak = 00, then ry~, be = 0. 
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Proof. By hypothesis, choose N € N so large that 0 < a, < by fork > N. 
Set sy = )op_1 a and t, = D-7_) be, n EN. Then 0 < s, —sy < t) — ty for 
alln > N. Since N is fixed, it follows that s,, is bounded when 1, is, and t, is 
unbounded when s, is. Apply Theorem 6.11 and the proof of the theorem is 
complete. rs) 


The Comparison Test is used to compare one series with another whose con- 
vergence property is already known (e.g., a p-series or a geometric series). 
Frequently, the inequalities | sinx| < |x| for all x € R (see Appendix B) and 
|logx| < x® for each aw > 0 provided x is sufficiently large (see Exercise 4.4.6) 
are helpful in this regard. Although there is no simple algorithm for this process, 
the idea is to examine the terms of the given series, ignoring the superfluous fac- 
tors, and dominating the more complicated factors by simpler ones. Here is a 
typical example. 


6.15 EXAMPLE. 


Determine whether the series 


[o.e) 

3k logk 
sa (5) 
ae +k k 


converges or diverges. 


Solution. The kth term of this series can be written by using three factors: 


1 3k /logk 


The factor 3k/(k + 1) is dominated by 3. Since logk < Vk for large k, the factor 


J/logk/k satisfies 
jlogk _ Vk 1 
k “Vk 


for large k. Therefore, the terms of (5) are dominated by 3/k°/4. Since 
> 21 3/k°4 converges by the p-Series Test, it follows from the Comparison 
Test that (5) converges. a 


The Comparison Test may not be easy to apply to a given series, even when 
we know which series it should be compared with, because the process of com- 
parison often involves use of delicate inequalities. For situations like this, the 
following test is usually more efficient. 
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6.16 Theorem. [LIMIT COMPARISON TEST]. 
Suppose that a, > 0, that by > 0 for large k, and that L := Vimy dn /bn exists 
as an extended real number. 


i) If0 < L < ©, then )-7°., ag converges if and only if \°°-., by converges. 
ii) If L =Oand ~~, be converges, then \~y~; ax converges. 
ili) If L = coand y~~, by diverges, then )-°- ; ax diverges. 


Proof. i) If L is finite and nonzero, then there is an N € N such that 


L, abs 
—by < ay < — 
5 k k 5) k 


fork > N. Hence, part i) follows immediately from the Comparison Test 
and Theorem 6.10. Similar arguments establish parts ii) and iti)—see Exer- 
cise 6.2.6. a 


In general, the Limit Comparison Test is used to replace a series )°7°2, ax by 
pas by when ag © Chy for k large and some absolute fixed constant C. For 
example, to determine whether or not the series 


8) eg +k? + 5k 


converges, notice that its terms are approximately 1/(2k) for k large. This leads 
us to compare S with the harmonic series )°7° , 1/k. Since the harmonic series 
diverges and since 


k/(V4k* + k2 + 5k) k? 1 4 
= ze > 
I/k V4k4 +k? 45k 2 


as k — oo, it follows from the Limit Comparison Test that S diverges. 
Here is another application of the Limit Comparison Test. 


6.17 EXAMPLE. 
Let a — Oask — oo. Prove that }°7°2, sin|ax| converges if and only if 
> 21 lax] converges. 


Proof. By \’Hopital’s Rule, 


sin lax| _ sinx 
lim —— = lim —=l1. 
k>o0 lal x>0+ X 


Hence, by the Limit Comparison Test, )°?°, sin |a,| converges if and only if 
> 21 lax| converges. . 
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EXERCISES 


6.2.0. Let {a,} and {b;,} be real sequences. Decide which of the following state- 
ments are true and which are false. Prove the true ones and give coun- 
terexamples to the false ones. 

a) If °°, a, converges and ax/by > O ask — ov, then 72, de 
converges. 

b) Suppose that 0 < a < 1. Ifa, > Oand ¥/ay < a for all k € N, then 
yee ak converges. 

c) Suppose that a, > Oask — oo. Ifa, => Oand /agy1 < ax for all 
k EN, then °°, ax converges. 

d) Suppose that a, = f(k) for some continuous function f : [1, co) > 
[0, co) which satisfies f(x) > 0 asx > oo. If papal ax converges, 
then /;° f(x)dx converges. 


6.2.1. Prove that each of the following series converges. 
oo Ihe 
a). Oe sagan ga ase 
k3 —4k +5 


= 
IL 
') 


oO 
— 
iM2 
w= 


_ 
ie) 
jefe} 


2 
iM2 
Pas 
as} 
Vv 
Vv 
i 


6.2.2. Prove that each of the following series diverges. 


00 3k3 4 D2k2 +k 


a) 2 ee 


co VE 
Db) 
ka k 
oo /k4+2\* 
Cc ——————— 
x & F i) 
a) 1 
d 
) 2, Elog? ps 
6.2.3. If a, => 0 is a bounded sequence, prove that 
CO 
ae 
os (k + 1)? 


converges for all p > 1. 
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6.2.4. Find all p > 0 such that the following series converges: 


Co 


1 
are 
ia) K log (k +1) 


6.2.5. If )-r-, |ax| converges, prove that 


Ce 
laxl 
Sg 
converges for all p > 0. What happens if p < 0? 
6.2.6. Prove Theorem 6.16ii and iii. 
6.2.7. Suppose that a, and by are nonnegative for all k € N. Prove that if 
vey ax and °°, by converge, then )°°°, ab; also converges. 
6.2.8. Suppose that a, b € R satisfy b/a € R\Z. Find all g > 0 such that 


converges. 
6.2.9. Suppose that a, > 0. Prove that )°7°, a, converges if and only if the 
series poet (a2~ + d2x41) converges. 
6.2.10. Find all p € R such that 


[e,2) 


1 
B (log (log k))? eek 


k=2 


converges. 


6.3 ABSOLUTE CONVERGENCE 


In this section we investigate what happens to a convergent series when its terms 
are replaced by their absolute values. We begin with some terminology. 


6.18 Definition. 
Let S = }°72, ax be an infinite series. 


i) S is said to converge absolutely if and only if )°7°., |ax| < 00. 
ii) S is said to converge conditionally if and only if S converges but not abso- 
lutely. 


The Cauchy Criterion gives us the following test for absolute convergence. 
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6.19 Remark. A series )-7-., ax converges absolutely if and only if for every 
€ > 0 there isan N €N such that 


m 
m>n->WN_ implies yy lax | < €. (6) 
k=n 


As was the case for improper integrals, absolute convergence is stronger than 
convergence. 


6.20 Remark. [f 5°72, ax converges absolutely, then )-~°., ax converges, but 
not conversely. In particular, there exist conditionally convergent series. 


Proof. Suppose that )°7°, ax converges absolutely. Given « > 0, choose 
N €Nso that (6) holds. Then 


m 


a 


k=n 


m 


< lal <e 
k=n 


form >n > N. Hence, by the Cauchy Criterion, )°7- , ax converges. 

We shall finish the proof by showing that S := )-72, (-l*/k converges 
conditionally. Since the harmonic series diverges, S does not converge abso- 
lutely. On the other hand, the tails of S look like 


ie.2) 


(—1)/ ry 1 1 1 
yo —— =(-) + ree 
a ko KL kD? RAB 


By grouping pairs of terms together, it is easy to see that the sum inside the 
parentheses is greater than 0 but less than 1/k; that is, 


[e,2) 


Ss oy 2 . 


jak J 


Hence °°, (—1)*/k converges by Corollary 6.9. a 


We shall see below that it is important to be able to identify absolutely con- 
vergent series. Since every result about series with nonnegative terms can be 
applied to the series )°7°, |ax|, we already have three tests for absolute con- 
vergence (the Integral Test, the Comparison Test, and the Limit Comparison 
Test). We now develop two additional tests for absolute convergence which are 
arguably the most practical tests presented in this chapter. 

Before we state these tests, we need to introduce another concept. (If you 
covered Section 2.5, you may proceed directly to Theorem 6.23.) 
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6.21 Definition. 


The limit supremum of a sequence of real numbers {xx} is defined to be 


lim sup x, := lim (sup). 
now 


k>oo k>n 


NOTE: Unlike the limit, the limit supremum of a sequence always exists as 
an extended real number. Indeed, let s, := sup,., xx. If s) = o for all n, 
then s, — co asn — ov, So, by definition, the limit supremum of x, is co. On 
the other hand, if s, is finite for some n, the Monotone Property for Suprema 
implies that the sequence s, is decreasing. Hence, by the Monotone Conver- 
gence Theorem, limy—+oo 5, exists. (It might be —oo, e.g., when x, = —k.) 

In practice, the limit supremum of a sequence is usually easy to find by 
inspection. For example, since (—1)* is 1 when k is even and —1 when k is odd, it 
is clear that sup;..,(—L)* = | for alln € N. Hence the limit supremum of (=1)" 
is 1. Similarly, 

k k 
lim sup(3 + ang: =4 and limsup ola tee = 


k>oo k->0o k 


The only thing we need to know about limits supremum (for now) is the 
following result. 


6.22 Remark. Let x € R and {xx} be a real sequence. 


i) Tf lim supy_, 55 Xk < x, then x, < x for large k. 
ii) Jf lim sup;_, .. X% > xX, then x, > x for infinitely many k’s. 
iti) If xp > x ask — oo, then limsup,;_,.o Xe = x. 


Proof. Let s := limsup,_,,,x, and s, := sup,., x, and recall by Defini- 
tion 6.21 that s, > sasn > oo. 

i) If s < x, then there is an N € N such that sy < x. In particular, x, < x 
for allk > N. 

ii) Ifs > x, then s, > x for all n (because s, is decreasing). Since s1 > x, 
there is ak, > 1 such that x;, > x. Suppose that k; has been chosen so that 
Xkj > xX. Since Sky > Xs there is akj4; > k; such that Xkja, > X- In particular, 
there is an increasing sequence of positive integers k; such that x,, > x for all 
J EN. It follows that x, > x for infinitely many k’s. 

iti) If x, converges to x, given e > 0 there is an N € N such thatk > N 
implies |x, — x| < ¢. In particular, for anyn > N, x. >x—efork >n. 

Taking the supremum of this last inequality over k > n, we see that s, > 
x —é€forn > N. Hence, the limit of the s,’s satisfies s > x —e. Thuss > x. A 
similar argument proves that s < x. a 


The limit supremum gives a very useful and efficient test for absolute 
convergence. 
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6.23 Theorem. [ROOT TEST]. 


Let a, € Rand r := limsup,_,,, lag |1/*. 


i) Ifr <1, then \\~°., ax converges absolutely. 


ii) [fr > 1, then °°, ax diverges. 


Proof. i) Suppose that r < 1. Letr < x < 1 and notice that the geometric 
series )-7°., x* converges. By Remark 6.22i (or by Exercise 2.5.3), 


lap ex 


for large k. Hence, |ax| < x* for large k and it follows from the Comparison 
Test that 5°72, |ax| converges. 
ii) Suppose that r > 1. By Remark 6.22ii (or by Exercise 2.5.3), 
ja > 1 
for infinitely many k €¢ N. Hence, |ax| > 1 for infinitely many & and it follows 
from the Divergence Test that }°7° , ax diverges. ia 


Note by Remark 6.22iii or Theorem 2.36 that if r := limo Jax.|!/ K exists, 
then (by the Root Test) }°7° , ag converges absolutely when r < 1 and diverges 
when r > 1. 

The following test is weaker than the Root Test (see Exercise 6.3.8) but is 
easier to use when the terms of }°7°, a, are made up of products (e.g., of 
factorials). 


6.24 Theorem. [RATIO TEST]. 
Let ax € R with a, 4 0 for large k and suppose that 


lake 
r= lim 
k>oo |ak| 


exists as an extended real number. 

i) Ifr <1, then °°, ay converges absolutely. 

ii) [fr > 1, then °°, ag diverges. 

Proof. \fr > 1, then |ag41| = |ax| for k large and thus a, cannot converge to 


zero. Hence, by the Divergence Test, )°7° , ax diverges. 
Ifr < 1, then observe for any x € (r, 1) that 


lax+1\ x 
< k= 
lag | Xx 
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for k large. Hence, the sequence |a,|/x* is decreasing for large k and thus 
bounded. In particular, there is an M > 0 such that |a,| < Mx* for allk EN. 
Since x < 1, it follows from the Comparison Test that 5°72, |ax| converges. 


6.25 Remark. The Root and Ratio Tests are inconclusive when r = 1. 


For example, apply the Root and Ratio Tests to )7?2, 1/k and °°, 1/k?. In 
all four cases, r = 1. Nevertheless, the first series diverges whereas the second 
converges absolutely. 

How should we proceed when the Root and Ratio Tests are inconclusive (e.g., 
when r = 1)? We can always try to use one of the Comparison Tests. Since this 
can be technically daunting, there are other ways to cope with the case r = 1. 
If the ratios of terms of a series converge to 1 rapidly enough, then the series 
converges. (For three tests of this type, see the results and exercises of Sec- 
tion 6.6.) If the terms of a series have k! as a factor, then there is a very useful 
asymptotic estimate of k! (called Stirling’s Formula—see Theorem 12.73) that 
can be used in conjunction with the Comparison Test (e.g., see Exercises 6.3.3f 
and 6.6.2c). 


It is natural to assume that the usual laws of algebra hold for infinite series 
(e.g., associativity and commutativity). Is this assumption warranted? We have 
“inserted parentheses” (i.e., grouped terms together) to aid evaluation of some 
series [e.g., to evaluate some telescopic series and to prove that )-?°2.,(—1)‘/k 
converges conditionally]. This is valid for convergent series (absolutely or con- 
ditionally) because if the sequence of partial sums s, converges to s, then any 
subsequence s,, also converges to s. The situation is more complicated when 
we start changing the order of the terms (compare Theorem 6.27 with Theo- 
rem 6.29). To describe what happens, we introduce the following terminology. 


6.26 Definition. 


A series )°72, 5; is called a rearrangement of a series }°7°; ax if and only if 
there is a 1-1 function f from N onto N such that 


brik =a, kKeEN. 


The following result demonstrates why absolutely convergent series are so 
important. 


6.27 Theorem. [f 5°72, ax converges absolutely and ie , bj is any rearrange- 
ment of > p=) ak, then Y77°, bj converges and 


CO CO 
Sax = by 
k=1 j=l 
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Proof. Let ¢ > 0. Set sn = op) ak, S= po ax, and tn = yy bj,n,m EN. 
Since }°7° ; ax converges absolutely, we can choose N € N (see Corollary 6.9) 
such that 


[e,2) 


E 
ye lal <5: (9) 
k=N+1 
Thus 
CO CO € 
lsy —s| = a ak) < >. lag| < ri (10) 
k=N+1 k=N+1 


Let f be a 1-1 function from N onto N which satisfies 
brik) = 4k; keN 
and set M = max{f(1),..., f(N)}. Notice that 
{a,,...,an} C {hy,..., by}. 


Let m > M. Then ft, — sy contains only ax’s whose indices satisfy k > N. 
Thus, it follows from (9) that 


= é 
ltm—swl< Do lal <5. 


Hence, by (10), 


gE E€ 
Mam ae ea a Rae ee a 


for m > M. Therefore, 
CO 
ee | 
j=l 


The rest of this section, which is used nowhere else in this book, is optional. 

We now show that Theorem 6.27 fails in a catastrophic way for conditionally 
convergent series (see Theorem 6.29 below). To facilitate our discussion, recall 
(see Exercise 1.2.3) that the positive and negative parts of ana € Rare defined by 


ata lal te _ a a>0 
Te BNO) «ae 


and 


| 
a 
S 
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Notice that 
a* > 0, 
and 
a=at— a, 
foralla ER. 
*6.28 Lemma. 


Suppose that a, € R fork EN. 


(7) 


(8) 


i) If 7) ax converges absolutely, then so do \°~°., af and YY, a, . In fact, 


CO CO CO [o,2) CO CO 
Slal= art Se and Ya = dat — Par, 
k=1 k=1 k=1 k= k=1 


k=1 


ii) If \°724 ax converges conditionally, then 


Proof. By definition, at = (\ax| + ax) /2. Since both 77° , |ax| and S772, ax 
converge, it follows from Theorem 6.10 that 


Ce A 1 Ce 1 Co 
oat = 5 ula + 5) ae 
k=1 k=1 k=1 


converges. Similarly, 


converges. This proves part i). 


a i 
yee = Dae - 2th 


Suppose that part ii) is false. By symmetry we may suppose that )°7° ; at 


converges. Since }°7°, ax converges, it follows from (8) that 


converges. Thus, 


ioe) (oe) ioe) 
Dilael = Dae + Day 
k=1 


k=1 


converges, a contradiction. 


k=1 
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We are prepared to show that Theorem 6.27 is false if the hypothesis “abso- 
lutely convergent” is dropped. In fact, as the following result shows, rearrange- 
ments of conditionally convergent series can converge to anything one wishes 
(see also Exercise 6.3.10). 


*6.29 Theorem. [RIEMANN]. 
Letx €R. If 2-2, ax is conditionally convergent, then there is a rearrangement 
of Yor, ag which converges to x. 


SrrateGy: The idea behind the proof is simple. Since @2,a~i = Ry 
a, = 00 by Lemma 6.28, begin by adding enough at’s until the resulting par- 
tial sum is > x. Then subtract enough a, ’s until the resulting partial sum is < x, 
and continue adding and subtracting. Since a, — 0 as k — ov, the resulting 
partial sums should be getting closer to x. We now make this precise. 


Proof. Since )°¢°., a = 00, let k; be the smallest integer which satisfies a + 
a; +: . ay, > x. Since kj is least, aj +- . tay <x, soa} +a; +- . ay, < 


x + G;. Set ro = 0 and observe that sx, 41) := ay +... eG ky SX +a. 


Suppose for some j > | that integers 79 < ry < --- < rj-; andky < 
ky < -+- < k; have been chosen such that a partial sum Skj> of aj’’s and a, ’s, 
satisfies 

+ 
Skj+rj—1 <x + a, (11) 
Since 2, a, = ©, let rj > rj—1 be the smallest integer which satisfies 
eet hit =e a, < x. Forkj +rj-1 <n < kj +1rj, set sy c= Sk; — 

! A he J d 
Bp gto Oe Ogg It is easy to see that Skj+rj1 2 Skjtrjatl 2+ 2S Skjtr;- 


Since r; is least, we also have Skj+rj = Skjtrj—1 — a, >x- a, . It follows from 
(11) that 


sn —x| < max{a,,,.a;} for kj +rj1<n<kj+r;. (12) 
Similarly, if we let k;4 be the smallest integer which satisfies 


43 + 
Skjtrj Tp. 4, Po Tay > 


: + + 
and set sy, := Skj+r; + ag 44 +.-- + yy; fork; +rj <n <kj41+7;, then 
— + 
lsp) —x| < max{a,,;, a, for kj +rj<n<kjaitr;. (13) 


Let ¢ > 0. Since each at and —a, is either a, or 0, it is clear (after deleting 
the zero terms) that the s,,’s are partial sums of a rearrangement of )°7°, ag. 
Moreover, since a, — 0 as k — oo, we can choose an N so large that j > N 
implies that a, and a, are both less than ¢. We conclude by (12) and (13) 


that ifn > ky +ry, then |s, — x| < e; that is, that s, > x asn > oo. | 
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EXERCISES 


6.3.0. 


6.3.1. 


6.3.2. 


Let {ax} and {by} be real sequences. Decide which of the following state- 
ments are true and which are false. Prove the true ones and give coun- 
terexamples to the false ones. 


a) Suppose that 0 < a < oo. If \a,|“/* + ao, where ay < 1, then 
Ye af is absolutely convergent. 
b) If 072) ax is absolutely convergent and a, | 0 as k > oo, then 


1/k 


lim sup |ag|"/" < 1. 


k->co 


c) If ag < by for allk € N and ya , D« is absolutely convergent, then 
ye ak converges. 

d) If )°¢2, ax is absolutely convergent, then )-7° | az is absolutely con- 
vergent. 


Prove that each of the following series converges. 


Decide, using results covered so far in this chapter, which of the follow- 
ing series converge and which diverge. 


) by 2 
a 
fest flogk 


lee) ae 


b) Xu 
E(B wai) 
- + 4k 
2-4--- (2k) 
arene ae \h 


G=11\* 
(Ss) 


14s(=1)* ‘ 
3 


6=cp)" 
3k 


Rex 
Me f 


& 
Me 


~ 
ll 
a 


<a) 
Me 


~ 
ll 
ul 


sed 
Me 


= 
ll 
_ 
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6.3.3. For each of the following, find all values of p € R for which the given 
series converges absolutely. 


a) > ee 


" » es 
e) Lert —k?P) 


oo he 


1) De 
k=1 


6.3.4. Suppose that a, > 0 and that a,! * _, aask > oo. Prove that eer anx* 


converges absolutely for all |x| < 1/a ifa #0 and for allx ¢ Rifa =0. 
6.3.5. Define a, recursively by a; = | and 


1 —l 
ak = (=1)* (1 —kcos (z)) ak—1, k> 1. 


Prove that )°7°., a; converges absolutely. 
6.3.6. Suppose that a,j => 0 fork, j € N. Set 


lee) 
Ak => Sak 


j=l 


for each k € N, and suppose that )°7°, A; converges. 


a) Prove that 


b) Show that 
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6.3.7. 


6.3.8. 


6.3.9. 


*6.3.10. 


6.3.11. 
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c) Prove that b) may not hold if a; has both positive and negative 
values. 
Hint: Consider 


1 j=k 
aKj = —1 J=kt+i1 
0 otherwise. 


a) Suppose that 5°72, a; converges absolutely. Prove that )°?-; |ax|? 
converges for all p > 1. 

b) Suppose that )°?° , az converges conditionally. Prove that }°7°., k? ax 
diverges for all p > 1. 


For any real sequence {x,;}, define 


liminf x, := lim (it ss) : 
k>oo noo \k>n 
a) Prove that if lim infy...x, > x for some x € R, then x, > x for 
k large. 
b) Prove that if x, > x ask — oo, for some x € R, then lim infg_, 
Xp =X. 
c) Ifa, > 0 for allk € N, prove that 


ak a 
lim inf < lim inf ax < lim sup ax < lim sup —— = 
OO Ak k> k>00 | =k 


d) Prove that if b, € R \ {0} and |by41/bn| > r asn — ov, for some 
r > 0, then |b,|!/" > rasn > oo. 


Given that )°?°,1/k? = 27/6 (see Exercise 14.3.7), find the exact 
value of 


S 1 
k=) 


Let x < y be any pair of extended real numbers. Prove that if )°7° ; ax 
is conditionally convergent, then there is a rearrangement ae b; of 


> 1 ae Whose partial sums s,, satisfy 


liminfs, =x and limsups, = y. 
We n->0o 


a) Using Exercise 4.4.4, prove that 
oo ky 2k+1 
ts ae IER 
sinx = Sera TD! 


k=0 


for all x € [0, 2/2]. 


6.4 
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b) Prove that 


oo (= 1)kx2k 
(Qk)! 


COs x = 
k=0 


for x € [0, 2/2]. 


ALTERNATING SERIES 


We have identified many tests for absolute convergence but have said little 
about conditionally convergent series. In this section we derive two tests to 
use on series whose terms have mixed signs. 

Both tests rely on the following algebraic observation. (This result will also 
be used in Chapter 7 to prove that limits of power series are continuous.) 


6.30 Theorem. [ABEL’S FORMULA]. 
Let {ax}ken and {bx}een be real sequences, and for each pair of integers n > 


m > 1 set 
n 
Anwm = ye ak. 
k=m 
Then 
n n—-1 
Ds abe = An,mbn = ye Ak.m(bk41 = bx) 
k=m k=m 


for all integersn > m > 1. 


Proof. Since Axm — Ak—1),m = ax for k > mand Am m = am, we have 


n n 
x abe = Anbm + ye (Akm — A(k—1),m) Dx 


k=m k=m+1 
n n—-1 
= Ambm + So Ax, mbx a ye Ag,mbK41 
k=m+1 k=m 
n—1 n—-1 
= Anbm + ye Ak mbx + An,mbn _ > Ak mbk+1 = Am,mbm+1 
k=m+1 k=m+1 
n—-1 
= An,mbn — Am,m(Om4+1 eS ys Ak,m(bk+1 — bk) 
k=m+1 
n—-1 
= Anmbn — Y> Akm(be+1 — be). 7 


k=m 
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This result is somewhat easier to remember using the following analogy. If 
f : U1, N] ~ R for some N €N, then the summation ie f(&) is an approxi- 
mation to J fa f(x)dx and the finite difference f (k + 1) — f(k) is an approxima- 
tion to f’(k) fork = 1,2,..., N — 1. In particular, summation is an analogue of 
integration and finite difference is an analogue of differentiation. In this con- 
text, Abel’s Formula can be interpreted as a discrete analogue of integration 
by parts. 

Our first application of Abel’s Formula is the following test. (Notice that it 
does not require that the a;’s be nonnegative.) 


6.31 Theorem. [DIRICHLET’S TEST]. 
Let ax, by € R fork € N. If the sequence of partial sums sy = = ag 1s 
bounded and b,; | 0 as k — oo, then baa aby converges. 


Proof. Choose M > 0 such that 


” M 
= << : 
ISn| 2 => neN 
By the triangle inequality, 
2D M M 
|Anm| = a =e tS et aM 


forn>m>1. 
Let « > 0 and choose N € N so that |by| < e/M fork > N. Since {bz} is 
decreasing and nonnegative, we find, by Abel’s Formula, the choice of M, and 


by telescoping that 
n n—-1 
Yo aebe| < |Anml lBnl + 95 |g ml (be = be-41) 
k=m k=m 
< Mb, + M(bym — bn) = Mby, < € 
foraln>m>N. |_| 


The following special case of Dirichlet’s Test is widely used. 


6.32 Corollary. [ALTERNATING SERIES TEST]. If az | 0 as k — o0, then 


YD ia 
k=1 


converges. 
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Proof. Since the partial sums of }°7°., (—1)* are bounded, 4 (—1)* ax con- 
verges by Dirichlet’s Test. a 


We note that the series )°?° (—1)* /k, used in Remark 6.20, is an alternating 
series. Here is another example. 
6.33 EXAMPLE. 
Prove that )-?°,, (—1)*/logk converges. 


Proof. Since 1/logk | 0 ask — om, this follows immediately from the Alter- 
nating Series Test. | 


The Dirichlet Test can be used for more than just alternating series. 


*6.34 EXAMPLE. 
Prove that S(x) = 772, sin(kx)/k converges for each x € R. 


Proof. Since $(x) = sin(kx) is periodic of period 27 [i.e., d(x + 27) = o(x) 
for all x € R] and has value identically zero when x = 0 or 27, we need only 
show that S(x) converges for each x € (0, 27). By Dirichlet’s Test, it suffices 
to show that 


n 
Dy (x):= )- sin(kx), neN (14) 
k=l 
is a bounded sequence for each fixed x € (0, 277). 
This proof, originally discovered by Dirichlet, involves a clever trick which 
leads to a formula for D,. Indeed, applying a sum angle formula (see 
Appendix B) and telescoping, we have 


2sin (5 ) By (x) = Len ) sin(kx) 


Therefore, 


for alln EN. |_| 
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REVIEW. We have introduced more than a dozen tests for conver- 
gence/divergence. With such a wealth of options, students can sometimes be 
overwhelmed. Here is one suggestion for an order in which to apply these tests 
to aseries S:= S°7°, ag. 


i) Try (but not too hard) to find L = limg_... a. If L # 0 or L doesn’t exist, 
S diverges by the Divergence Test. If L = 0 or L is too hard to find, continue. 

ii) If a, is geometric or a p-series, use Theorem 6.7 or Corollary 6.13 to deter- 
mine convergence properties of )°7°., ax. If ax looks a lot like some geo- 
metric or a p-series by, use the Limit Comparison Test, replace ax by by, and 
apply 6.7 and 6.13 to bx. 

iii) Try to find 
r= lim art] or r = limsup uel; 
k->oo |ag| k>0o 
Ifr < 1, then S converges absolutely. Ifr > 1, then S diverges. Ifr = 1 or 
these limits are too hard to evaluate, continue. 

iv) Ifthe series “alternates” [has factors that oscillate between positive and neg- 
ative values, i.e. (—1)*, sink or cos(2k+ 1)], try to use the Alternating Series, 
the Dirichlet, or Abel’s Test (see Exercise 6.4.4). 

v) If |ax| ~ by, where by is some nonnegative sequence such that the con- 
vergence property of )°7~., bg is known, try the Limit Comparison Test or 
the Comparison Test. If |a,| is “integrable with respect to k,” try the Inte- 
gral Test. 


As long as you don’t spend too much time on any one step, you should con- 
verge (no pun intended) to an answer fairly quickly. If you get to the end of the 
process and still haven’t arrived at a conclusion, repeat the steps again, trying a 
little harder this time. Most series, especially the ones that come up in practice, 
will succumb to this process sooner rather than later. 


EXERCISES 


6.4.0. Let {az} and {bx} be real sequences. Decide which of the following state- 
ments are true and which are false. Prove the true ones and give coun- 
terexamples to the false ones. 


a) If a | 0, ask — o, and ei by converges conditionally, then 
ye ache converges. 

b) Ifa, > 0, ask > oo, then re Dag converges. 

c) If a, > 0, ask > oo, and a > 0 for all k € N, then 1, (-D*ax 
converges. 

d) If a —> 0, ask — oo, and Ye Dkax converges, then a; | 0 as 
k> ow. 


6.4.1. Prove that each of the following series converges. 
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00 k 
b) yo uy xeR, p>0 


ee 


co 44+ 2sin(1/k) 


e) Ze 1)" (F + aretan ) 


6.4.2. For each of the following, find all values x € R for which the given series 


converges. 
[oe) k 
a) — 
k=1 
ioe) 5k 
b) 3 
k=1 
> (—x)* 
Cc ——————— re 
k=l Vk* + 4k +1 
(2x — 3)* 
d 
) = k Jak +1 


6.4.3. Using any test covered in this chapter so far, find out which of the follow- 
ing series converge absolutely, which converge conditionally, and which 
diverge. 

co (—1)* 2K4 

a Do 
ee (k+DI a 2)! 
(= ~ = 4)... (—2k) 
b) > 

k=1 “(5k = 1) 
i cp. ere 
pet! (k+1)! ; 

y= (—D*! VE 1 

kel k 

ge ao) vk 


p> 


6.4.4. [ABEL’s Test] Suppose that }°7°., a; converges and that by | bask —> oo. 
Prove that Sei a,b, converges. 
6.4.5. Show that under the hypotheses of Dirichlet’s Test, 


Yaa = Se — bk+1). 


k=1 
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6.4.6. Suppose that {a,} and {bx} are real sequences such that a, — 0 as 


k>o, 
[o,2) n 
SS lak+1 — ae <oo, and | So & |< _M neN. 
k=1 k=1 


Prove that ye | abe converges. 
6.4.7. Suppose that }77°, ax converges. Prove that if bk ¢ co and S°°° , aby 
converges, then 


CO 
bn » ar > 0 
k=m 
asm —> o. 
*6.4.8. Prove that 


CO 
Soak cos(kx) 
k= 


converges for every x € (0,27) and every a; | 0. What happens when 
x =0? 
*6.4.9. Suppose that a, | 0 ask — oo. Prove that 


Soak sin((2k + 1)x) 


k=1 


converges for all x € R. 


*6.5 ESTIMATION OF SERIES 


In practice, one estimates a convergent series by truncation (i.e., by adding 
finitely many terms of the given series). In this section we show how to esti- 
mate the error associated with such a truncation. 

The proofs of several of our earlier tests actually contain estimates of the 
truncation error. Here is what we can get from the Integral Test. 


6.35 Theorem. Suppose that f : [1,c0o) — R is positive and decreasing on 
[1, co). Then 
f@< Fk) -[ f(x)dx < f() forneN. 
k=1 
Moreover, if para , f (k) converges, then 
0< >) + [ fads -¥@ < FO) 
k=1 e k=1 


foralln EN. 
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Proof. The first set of inequalities has already been verified [see (3) in the 


proof of Theorem 6.12]. To establish the second set, let uz = sy — t fork EN, 
and observe, since f is decreasing, that 


k+1 
0<m—ue = | f@jde= fk) =f) f+ 1). 


Summing these inequalities over k > n and telescoping, we have 


OS un — tim uj = ) (ue — uni) = DSH — f+) = fo. 


k=n k=n 


Since uj > 1 f() - re f (x)dx as j — oo, we conclude that 


0< rw+f ford YorW = Fe. g 
k=1 N 


k=l 
The following example shows how to use this result to estimate the accuracy 
of a truncation of a series to which the Integral Test applies. 


6.36 EXAMPLE. 


Prove that °°, ke-® converges and estimate its value to three decimal places. 


Proof. Let f(x) = xe’. Since fix) = er (1 — 2%") = 0 forx > 1, f is 
decreasing on [1, oo). Since 


“S 1% 1 
/ re“ ax => [ e“du=— <a, 
1 2 1 2e 


it follows from the Integral Test that )°?° ; ke“® converges. 

By Theorem 6.35, the error of replacing s by }°7_, f(k) + i Sf (x)dx 
is dominated by f(n). By the rounding process, this estimate to s will be 
accurate to three decimal places if the error f(n) is < 0.0005. Since f(2) = 
0.036631 and f (3) = 0.000370, it follows that we should use n = 3. Since 


3 Co 
bes Dis. 3 1 
Ske +f xe a= at 5 ae 0.4049427, 
e e 
k=1 - 
we conclude that a three-place estimate to s is given by 0.405. A 


The next example shows that Theorem 6.35 can be used to estimate divergent 
series as well. 
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6.37 EXAMPLE. 
Prove that there exist numbers C,, € (0, 1] such that 


n 


1 
Ly c= logn+ Cy 
k=1 


for alln EN. 


Proof. Clearly, f(x) = 1/x is positive, decreasing, and locally integrable on 
[1, co). Hence, by Theorem 6.35, 


> [+4 de = Dj toon 5 a 


Next, we see what the Alternating Series Test has to say about truncation 
error. 


6.38 Theorem. Suppose thata, | Oask > o. Ifs = aaa. and 
Sp = pe (— Dk ag, then 0 < |s — Sn| < an41 foralln EN. 


Proof. Suppose first that n is even, say n = 2m. Then 


O > (—aam+1 + A2m+2) + (—d2m+3 + d2m+4) +--+ 


d) Dia = 5 — sy 


k=2m+1 


= —A2m+1 + (A2m4+2 — 42m+43) + (@2m+4 — A2m45) +-°: 
= —aIm+13 


that is,0 > s —s, > —ay4,1. A similar argument proves that 0 < s — sy < dy41 
when n is odd. | 


This result can be used to estimate the error of a truncation of any alternating 
series. 


6.39 EXAMPLE. 


For each a > 0, prove that the series "7° , (—1)kk/(k? +a) converges. If s, rep- 
resents its nth partial sum and s its value, find an n so large that s, approximates 
s to two decimal places. 


Proof. Let f(x) = x/(x? + a) and note that f(x) > 0 as x —> oo. Since 
f(x) = (a — x7) /(x? + a)? is negative for x > ./|a|, it follows that k/(k>+a) | 
0 as k — oo. Hence, the given series converges by the Alternating Series Test. 


Section 6.5 Estimation of Series 231 


By Theorem 6.38, s, will estimate s to two decimal places if f(n) < 0.005 
(i.e., ifn? —200n +a > 0). When a > 104, this last quadratic has no real roots; 
hence, the inequality is always satisfied and we may choose n = 1. When 
a < 104, the quadratic has roots 100 + 104 — a. Hence, choose any n which 
satisfies n > 100+ V10* —a@. a 


Finally, we examine what information the proofs of the Root and Ratio Tests 
contain about accuracy of truncations. 


6.40 Theorem. Suppose that )°¢°., ax converges absolutely and that s is the 
value of )~?~, |ak\- 


i) If there exist numbers x € (0, 1) and N € N such that 


lax] /" <x 
for allk > N, then 
n n+l 
O<s— < 
s—) lal < i 
k=1 
foralln>N. 
ii) If there exist numbers x € (0, 1) and N €N such that 
lak-+1| Z 
lax| 


fork > N, then 


foralln>N. 


Proof. Let n > N. Since |ax| < x* for k > N, we have, by summing a geomet- 
ric series, that 


n 0° oo n+l 
k Xx 
O0<s-) lax | = ) lax| < y ol ar 
k=1 k=n+1 k=n+1 =< 


for alln > N. This proves part i). The proof of part ii) is left as an exercise. 


6.41 EXAMPLE. 


Prove that )77° , k7*/(3k? +k)* converges absolutely. If s, represents its nth par- 
tial sum and s its value, find an n so large that s, approximates s to an accuracy 
of 10-7. 
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Solution. Since 


2k I/k k2 1 

Se ee 

(sexpr) 3k7 +k ~ 3 
for all k > N := 1, the series converges absolutely by the Root Test. Since 
(1/3)"+!/(1 — 1/3) < 10-? for n > 4, we conclude by Theorem 6.40i that it 
takes at most four terms to approximate the value of this series to an accuracy 
of 10-7. ai 


EXERCISES 


6.5.1. For each of the following series, let s, represent its partial sums and s its 
value. Prove that s is finite and find an n so large that s, approximates s 
to an accuracy of 1077. 


a) 2 ei (aresn 7) 
k=2 k 


oe) (<p! k3 


b) De Ak 

k=1 

CO ( =) 1h 3l (Ok = 1)! 
) Xu 2 DY. dl. - (2k)! 


6.5.2. a) Find all p > 0 such that the following series converges: 


[e,2) 


1 
, ter ERD’ 
tk log (k+ 1) 


b) For each such p, prove that the partial sums of this series s, and its 


value s satisfy 
n+p-1 1 
Is — Sy] < Bi ( =I ) 
n(p—1) \log?~*(n) 


6.5.3. For each of the following series, let s, represent its partial sums, and let 
s represent its value. Prove that s is finite and find an n so large that s, 
approximates s to three decimal places. 


oS 2 


for alln > 2. 


6.5.4. Prove Theorem 6.40i1. 
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*6.6 ADDITIONAL TESTS 


If the Ratio or Root Test yields a value r = 1, then no conclusion can be made. 
There are some tests designed to handle just that situation (see Exercise 6.6.3). 
We cover two of them in this section (see also Exercises 6.6.4 and 6.6.5). 

The first test compares the growth of the terms of a series with the growth of 
the logarithm function. 


6.42 Theorem. [THE LOGARITHMIC TEST]. 
Suppose that a, 4 0 for large k and that 


. log(1/laxl) 
p= im —W— 
k->0oo logk 


exists as an extended real number. If p > 1, then )-7°_, ax converges absolutely. 
If p <1, then \~?~., |ax| diverges. 


Proof. Suppose that p > 1. Fixg € (1, p) andchoose N € Nsothatk > N 
implies log(1/|ag|) > glogk = log(k?). Since the logarithm function is mono- 
tone increasing, it follows that 1/|a,| > k4; that is, that |a,| < k~? fork > N. 
Hence, by the Comparison Test, 5°72. |ax| converges. 

Similarly, if p < 1, then |ax| > 1/k for large k. Hence, by the Comparison 
Test, S772 |ax| diverges. | 


Our final test works by examining how rapidly the ratios of az+1 /az converge 
tor = 1 (see also Exercise 6.6.5 below). Its proof uses Bernoulli’s Inequality. 


*6.43 Theorem. [RAABE’S TEST]. 
Suppose that there is a constant C and a parameter p such that 


ak+1 
ak 


P 
~ kK+C o) 


for large k. If p > 1, then )°7°., ax converges absolutely. 


Proof. Set x, =k +C —1 fork € Nand choose N € N such that x; > 1 and 
(15) hold for k > N. By the p-Series Test and the Limit Comparison Test, 


CO 
Sa ee (16) 
k=N 


By (15) and Bernoulli’s Inequality, 


Pp 
Ak+1 Sei De os (1 1 ) a: 
= Xkt1 Xk+1 Kp 


ak 
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Hence, the sequence {|ax [ep }ecy 1s decreasing and bounded above. In partic- 
ular, there is an M > O such that |ax| < Mx,” fork > N. We conclude by (16) 
that )°?°., ax converges. a 


EXERCISES 


6.6.1. Using any test covered in this chapter, find out which of the follow- 
ing series converge absolutely, which converge conditionally, and which 


diverge. 
00 5-9---(4k +1) 
) aT Gk4) 
fa 4-7---GBkK+-1) 
co 5+ 8-+- (3k + 2) 
b) b 
gay 11-14--- BK +8) 
oe 1 
Cc oe 
) & (log k)3 os loak 
& 2 \vE 
»E(-z) 
ae 


6.6.2. For each of the following, find all values of p € R for which the given 
series converges absolutely, for which it converges conditionally, and for 
which it diverges. 


ee) 

a) >° 2k-HekP-l 
k=1 
ee) 

b) y (log k)3? los* 
k=2 


ee) kK 


*6.6.3. a) Prove that the Root Test applied to the series 


[o,2) 


1 
Ss (log k)log k 


k=2 


yields r = 1. Use the Logarithmic Test to prove that this series 
converges. 
b) Prove that the Ratio Test applied to the series 


13k = 1) 
ss 


£~4.6-+- (2k +2) 


yields r = 1. Use Raabe’s Test to prove that this series converges. 


6.6.4. 


6.6.5. 
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Suppose that f : R > (0, oc) is differentiable, that f(x) — 0as x > ow, 
and that 
xf"(x) 
= lim 
x00 f(x) 
exists. Ifa < —1, prove that aw , f (k) converges. 
Suppose that {a;} is a sequence of nonzero real numbers and that 


= lim k(1- 
p= jim e(r— |S) 


exists as an extended real number. Prove that )°7°., ax converges abso- 
lutely when p > 1. 


a: 


ak+1 
ak 
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Infinite Series of Functions 


7.1 UNIFORM CONVERGENCE OF SEQUENCES 


You are familiar with what it means for a sequence of numbers to converge. In 
this section we examine what it means for a sequence of functions to converge. 
It turns out there are several different ways to define convergence of a sequence 
of functions. We begin with the simplest way. 


7.1 Definition. 


Let E be a nonempty subset of R. A sequence of functions f,, : E — Ris said 
to converge pointwise on E (notation: f, — f pointwise on E as n — oo) if 
and only if f(x) = limp-soo fn(x) exists for each x € E. 


Because { f,,} converges pointwise on a set E if and only if the sequence of real 
numbers { f,,(x)} converges for each x € E, every result about convergence of 
real numbers contains a result about pointwise convergence of functions. Here 
is a typical example. 


7.2 Remark. Let E be a nonempty subset of R. Then a sequence of functions fy 
converges pointwise on E, asn — ox, if and only if for every ¢« > Oand x € E 
there is an N € N (which may depend on x as well as &) such that 


n>N_ implies |fy(x) — f(x)| <«. 


Proof. By Definition 7.1, f, > f pointwise on E if and only if f,(«) > f(x) 
for all x € E. This occurs, by Definition 2.1, if and only if for every « > 0 and 
x € E there isan N € N such that n > N implies | f, (x) — f()| < «. (| 


If f. — f pointwise on [a, b], it is natural to ask, What does f inherit from 


fn? The next four remarks show that, in general, the answer to this question is 
“not much.” 


7.3 Remark. The pointwise limit of continuous (respectively, differentiable) 
functions is not necessarily continuous (respectively, differentiable). 


Proof. Let fn(x) = x" and set 


O<x<l 


x=. 


f= 4h 
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Then f, — f pointwise on [0, 1] (see Example 2.20), each f, is continuous 
and differentiable on [0, 1], but f is neither differentiable nor continuous at 
x=1. a 


7.4 Remark. The pointwise limit of integrable functions is not necessarily inte- 


grable. 
Proof. Set 
1 x = p/m €Q, written in reduced form, where m <n 
frlx) = ; 
0 otherwise, 


forn € Nand 


fil x€EQ 
Ff) = otherwise. 


Then f, — f pointwise on [0,1], each f, is integrable on [0, 1] (with integral 
zero), but f is not integrable on [0, 1] (see Example 5.11). | 


7.5 Remark. There exist differentiable functions f, and f such that fr > f 
pointwise on [0, 1] but 


/ 
lim. fy) # (lim f(x) (1) 
no no 
for x = 1. 
Proof. Let f,(x) = x"/n and set f(x) = 0. Then f,, > f pointwise on [0, 1], 
each f, is differentiable with f/(x) = x”"~!. Thus the left side of (1) is 1 at 
a 


x = | but the right side of (1) is zero. 


7.6 Remark. There exist continuous functions f, and f such that f, — f point- 
wise on [0, 1] but 


1 1 
lim / falx) dx # f (Jim In(x)) dx. (2) 
n—>CoO 0) 0 n—-> CoO 


Proof. Let f\(x) = 1 and, forn > 1, let f,, be a sequence of functions whose 
graphs are triangles with bases 2/n and altitudes n (see Figure 7.1). By the 
point-slope form, formulas for these f,,’s can be given by 


n°x O<x<I/n 
fn(x) = 4 2n —n?x l/n<x<2/n 
0 2/n<x <1. 
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FIGURE 7.1 


Then f; — 0 pointwise on [0, 1] and, since the area of a triangle is one-half 


base times altitude, fis fnu(x) dx = 1 for alln € N. Thus, the left side of (2) is 1 
but the right side is zero. a 


In view of the preceding examples, it is clear that pointwise convergence is 
of limited value for the calculus of limits of sequences. It turns out that the fol- 
lowing concept, discovered independently by Stokes, Cauchy, and Weierstrass 
around 1850, is much more useful in this context. 


7.7 Definition. 


Let E be a nonempty subset of R. A sequence of functions f, : E > Ris said 
to converge uniformly on E to a function f (notation: f, > f uniformly on E 
as n — oo) if and only if for every ¢ > 0 there is an N € N such that 

n>WN implies |f,(x) — f@)| <e 


for allx € E. 


Comparing Definition 7.7 with Remark 7.2 above, we see that the only dif- 
ference between uniform convergence and pointwise convergence is that, for 
uniform convergence, the integer N must be chosen independently of x (see 
Figure 7.2). Notice that this is similar to the difference between uniform conti- 
nuity and continuity (see the discussion following Example 3.36). 

By definition, if f, converges uniformly on E, then f,, converges pointwise 
on E. The following example shows that the converse of this statement is false. 
[This example also shows how to prove that f, — jf uniformly on a set E: 
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~~ y=fa)+e 
y = fn(x) 
y = f(x) 


— _— 


~ 
~~ 7 
wa ~yafa@-eé 


#Y 


\- 
E 
FIGURE 7.2 


dominate | f, (x) — f(x)| by constants b,, independent of x € EF, which converge 
to zero asin — o0.] 


7.8 EXAMPLE. 


Prove that x” —> 0 uniformly on [0, b] for any b < 1, and pointwise, but not 
uniformly, on [0, 1). 


Proof. By Example 2.20, x” — 0 pointwise on [0, 1). Let b < 1. Givene > 0, 
choose N € N such that n > N implies b” < e. Then x € [0,b] andn > N 
imply |x"| < b” < e; that is, x” — 0 uniformly for x € [0, b]. 

Does x” converge to 0 uniformly on [0, 1)? If it does, then given 0 < ¢ < 
1/2, there is an N € N such that |x| < ¢ for all x € [0,1). Butx > 1 as 
x — 1—so we can choose an xo € (0, 1) such that xy > € (see Figure 7.3). 
Thus € < re < €,a contradiction. | 


The next several results show that if f, > f or f/ > f’ uniformly, then f 
inherits much from fy. 


7.9 Theorem. Let E be a nonempty subset of R and suppose that f, —> f uni- 
formly on E, asn — oo. If each f, is continuous at some xo € E, then f is 
continuous at xo € E. 


Proof. Let « > 0 and choose N € N such that 


n>N and x€E imply fu) — FO) <5. 
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Ay 


FIGURE 7.3 


Since fy is continuous at x9 € E, choose 5 > 0 such that 
Ix —xol <8 and xeE imply |fv(x) — fiv(eo)l < 5. 
Suppose that |x — xo| < 6 and that x ¢ E. Then 


If) — f(xo0)| < If) — fu @)| + Lin) — fn (X0)| + | fn (0) — f0) < €. 
Thus f is continuous at x9 € E. a 


(For a generalization of this result, see Exercise 7.1.6. For a converse of this 
result when the sequence /f, is pointwise monotone, see Theorem 9.60.) 

Here is an important theorem about interchanging a limit sign and an integral 
sign (compare with Remark 7.6). 


7.10 Theorem. Suppose that f, — f uniformly on a closed interval [a, b]. If 
each f, is integrable on [a, b], then so is f and 


b b 
tim, f fata) dx = fo (tim. fa(x)) de 


In fact, limy oof) fn(t) dt = f° f(t) dt uniformly for x € [a, b}. 
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Proof. By Exercise 7.1.3, f is bounded on [a, b]. To prove that f is integrable, 
let e > 0 and choose N € N such that 


n>N_ implies |f(x) — fa(x)| < 3 


é€ 
56) (3) 


for all x € [a, b]. Using this inequality for n = N, we see that by the definition 
of upper and lower sums, 


U(f— fn, P)=< 3 and Lf — fr, P)= 3 


€ 
3 


for any partition P of [a, b]. Since fy is integrable, choose a partition P such 
that 


€ 
It follows that 
UL PAH LL PSU = ino Ph). FU Pye Ls 2) LO ine?) 
& € €& 
< 3 + 3 + 3 = 6&; 
that is, f is integrable on [a, b]. We conclude by Theorem 5.22 and (3) that 


[noa- [roa 


for allx € [a, b]}andn > N. a 


~ é(x — a) 
< fim Folds Ba ce 


Here is a Cauchy Criterion for uniform convergence. 
7.11 Lemma. [UNIFORM CAUCHY CRITERION]. 
Let E be anonempty subset of Rand let f, : E — R be asequence of functions. 


Then fy converges uniformly on E if and only if for every ¢ > 0 there is an 
N €N such that 


n,m>N imply |fn(x)— fn(x)| <€ (4) 
forallx € E. 


Proof. Suppose first that f, > f uniformly on E asn — oo. Let ¢ > 0 and 
choose N € N such that 


n>N_ implies |f,(x) — f(x) < : 


for x € E. Since | fn(x) — fn(®)| < |fn@) — FO) +1F@) — fin@)|, it is clear 
that (4) holds for all x € E. 
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Conversely, if (4) holds for x € E, then {f,(x)}nen is Cauchy for each x € E. 
Hence, by Cauchy’s Theorem for sequences (Theorem 2.29), 


f(@):= lim fr(x) 
now 
exists for each x € E. Take the limit of the second inequality in (4) asm — oo. 
We obtain | f(x) — f(x)| < ¢/2 < efor alln > N andx e€ E. Hence, by 


definition, f, — f uniformly on E. a 


Here is a result about interchanging a limit sign and the derivative sign (com- 
pare with Remark 7.5). The proof presented here comes from Apostol [1]. 


7.12 Theorem. Let (a,b) be a bounded interval and suppose that fy is a 
sequence of functions which converges at some xo € (a,b). If each fn is dif- 


ferentiable on (a,b), and f; converges uniformly on (a,b) as n — oe, then fy, 
converges uniformly on (a, b) and 


/ 
lim, fe) = (lim f(x) 
for each x € (a, b). 
Proof. Fix c € (a, b) and define 


fn(x) = fn©) 


8n(X) = tae vee 
fi) x=C 
forn EN. Clearly, 
fn(x) = fn(c) + (& = €)gn(x) (5) 


forn € Nandx € (a,b). 

We claim that for any c € (a,b), the sequence g, converges uniformly on 
(a,b). Lete > 0, n,m €N, and x € (a,b) with x # c. By the Mean Value 
Theorem, there is a € between x and c such that 


Su(®) — fn(®) — (fnlo) — fn(e)) 


X—C 


8n(X) — Bm(X) = =f) = fe). 


Since f/ converges uniformly on (a,b), it follows that there is an N € N 
such that 


n,m>N_ implies [gn(x) — gm(x)| < € 


for x € (a,b) with x # c. This implication also holds for x = c because 
gn(c) = f,(c) for alln € N. This proves the claim. 

To show that f, converges uniformly on (a,b), notice that by the claim, 
gn converges uniformly as n — oo and (5) holds for c = xg. Since f, (xo) 
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converges as n —> oo by hypothesis, it follows from (5) and b — a < ov that f, 
converges uniformly on (a, b) asn — oo. 


Fix c € (a,b). Define f, g on (a, b) by f(x) := limy+soo fx(x) and g(x) := 
limy-so0 n(x). We need to show that 


fo) = lim, fy(0). (6) 


Since each g, is continuous at c, the claim implies g is continuous at c. Since 
gn(c) = f,(c), it follows that the right side of (6) can be written as 


lim fi(c) = lim gn(c) = g(c) = lim g(x). 
nw nw x>C 
On the other hand, if x 4 c we have by definition that 


FR)= FO _ ful) = ful) _ 
fO-fO _ 4 hO-hoO 


x-—C¢ noo x-Cc 7 


lim 8n(x) = g(x). 


Therefore, the left side of (6) also reduces to 


f= tim LO=LO — tim go. 
x>C x—C x>C 
This verifies (6), and the proof of the theorem is complete. | 
EXERCISES 
7.1.1. a) Prove that x/n — O uniformly, as n — ov, on any closed interval 
[a, b]. 
b) Prove that 1/(nx) — 0 pointwise but not uniformly on (0, 1) as 
n—> oO. 


7.1.2. Prove that the following limits exist and evaluate them. 


4 76 
y nx'° +4 
a) Hime | Pi qase 


b) limp +0 i ex'/" dx 


6) my =s65 Hie /cos* + x +2dx 


7.1.3. A sequence of functions f,, is said to be uniformly bounded on a set E if 
and only if there is an M > 0 such that | f,(x)| < M for all x € E and all 
neN. 

Suppose that for eachn EN, f, : E > Ris bounded. If f, > f 
uniformly on E, asn — N, prove that {f,} is uniformly bounded on E 
and f is a bounded function on E. 

7.1.4. Let [a, b] be a closed bounded interval, f : [a,b] — R be bounded, 

and g : [a,b] — R be continuous with g(a) = g(b) = 0. Let f, be a 
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7.1.5. 


7.1.6. 


7.1.7. 


7.1.8. 


7.1.9. 


7.1.10. 


7.111. 


Infinite Series of Functions 


uniformly bounded sequence of functions on [a, b] (see Exercise 7.1.3). 
Prove that if f, — jf uniformly on all closed intervals [c,d] C (a,b), 
then f,g > fg uniformly on [a, db]. 

Suppose that f, — f and g, — g,asn — oo, uniformly on some set 
ECR. 


a) Prove that fr + gn > f +g andaf, > af,asn — ov, uniformly on 
E foralla ER. 

b) Prove that fig, > fg pointwise on E. 

c) Prove that if f and g are bounded on £, then f,,g, > fg uniformly 
on E. 

d) Show that c) may be false when g is unbounded. 


Suppose that E is a nonempty subset of R and that f, — f uniformly 
on E. Prove that if each f, is uniformly continuous on E, then f is 
uniformly continuous on E. 

Suppose that f is uniformly continuous on R. If y, — 0 asn — oo and 
Sn(x) = f(x + yn) for x € R, prove that f, converges uniformly on R. 
Suppose that b > a > 0. Prove that 


b oe 
tim, f (1+=) e* dx =b—a. 
noo Jy n 


Let f, g be continuous on a closed bounded interval [a, b] with |g(x)| > 0 
for x € [a, b]. Suppose that f, > f and g, — g asn — o, uniformly 
on [a, b]. 


a) Prove that 1/g, is defined for large n and f,,/g, > f/g uniformly on 
[a, basn > oc. 
b) Show that a) is false if [a, b] is replaced by (a, b). 


Let E be anonempty subset of R and f be a real-valued function defined 
on E. Suppose that f, is a sequence of bounded functions on E which 
converges to f uniformly on E. Prove that 


AiG) + + fx@) 


n 


> f(x) 


uniformly on E as n — oo (compare with Exercise 6.1.9). 
Let f, be integrable on [0,1] and f,, > f uniformly on [0, 1]. Show that 
if b, ¢ 1 asn > ov, then 


by 1 
lim Sa(x) dx = / f(x) dx. 


7.2. UNIFORM CONVERGENCE OF SERIES 


In this section we extend the concepts introduced in Section 7.1 from sequences 
to series. 
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7.13 Definition. 


Let f; be a sequence of real functions defined on some set EF and set 


Sn (x) =) h@, xEE,neN. 


k=1 


i) The series }°°°, fx is said to converge pointwise on E if and only if the 
sequence s,(x) converges pointwise on E asn > ov. 
ii) The series )°7°, fx is said to converge uniformly on E if and only if the 
sequence s,(x) converges uniformly on E asn > ow. 
iii) The series )°7~., fx is said to converge absolutely (pointwise) on E if and 
only if }°7°., | fe(x)| converges for each x € E. 


Since convergence of series is defined in terms of convergence of sequences of 
partial sums, every result about convergence of sequences of functions contains 
a result about convergence of series of functions. For example, the following 
result is an immediate consequence of Theorems 7.9, 7.10, and 7.12. 


7.14 Theorem. Let E be a nonempty subset of R and let { f,} be a sequence of 
real functions defined on E. 


i) Suppose that xo € E and that each fy, is continuous at xo € E. If f = \-p-., fk 
converges uniformly on E, then f is continuous at xo € E. 

ii) [TERM-BY-TERM INTEGRATION]. Suppose that E = [a,b] and that each fy is 
integrable on [a,b]. If f = Yop24 fe converges uniformly on [a, b], then f is 
integrable on [a, b] and 


b © CO Ab 
[ Mhwa=> | fix) dx. 
4 k=l k=1°? 


iii) [TERM-BY-TERM DIFFERENTIATION]. Suppose that E is a bounded, open interval 
and that each f, is differentiable on E. If \-7°., fe converges at some xo € 
E, and \°°-., fy converges uniformly on E, then f := Yi, fe converges 
uniformly on E, f is differentiable on E, and 


(3109) =A 
k=1 k=1 
forx €E. 


Here are two much-used tests for uniform convergence of series. (The second 
test, and its example, is optional because we do not use it elsewhere in this text.) 
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7.15 Theorem. [WEIERSTRASS M-TEST]. 

Let E be a nonempty subset of R, let fy : E — R, k € N, and suppose that 
M; = 0 satisfies er My < o. If |fe(x)| < My fork € Nand x € E, then 
ye Se converges absolutely and uniformly on E. 


Proof. Let « > 0 and use the Cauchy Criterion to choose N € N such that 
m >n> N implies )°7_,, My < ¢. Thus, by hypothesis, 


Yo fe) 
k=n 


<lA@l< SoM <e 
k=n k=n 


form >n > WN andx € E. Hence, the partial sums of 77°, fy are uniformly 
Cauchy and the partial sums of PE od 1 |fe(x)| are Cauchy foreachxe EF. 


*7.16 Theorem. [DIRICHLET’S TEST FOR UNIFORM CONVERGENCE]. 
Let E be a nonempty subset of R and suppose that fx, g,: E > R, k EN. If 


Yo fix) 


k=1 


<M<ow 


forn € Nand x € E, and if gx | 0 uniformly on E as k > o, then °°, fea 
converges uniformly on E. 


Proof. Let 
n 
Frim(x) = ys Sk), mneN,n>m,xeE 
k=m 


and fix integers n > m > 0. By Abel’s Formula and hypothesis, 


n n—-1 
S> fx) ge(*)| = | Fam 2) 8n(x) +S Fim) (ger) = ge+1)) 
k=m k=m 


n—-1 


< 2Mgn(x) +2M D0 (ge(x) — geti(x)) 


k=m 


= 2Mgn(x) 


for all x € E. Since g,,(x) > 0 uniformly on E, as m — ov, it follows from the 
uniform Cauchy Criterion that }°7°., fe(x)gx(x) converges uniformly on E. 


Here is a typical application of Dirichlet’s Test. 
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*7.17 EXAMPLE. 


Prove that if a, | 0 as k — ov, then }°7° az coskx converges uniformly on any 
closed subinterval [a, b] of (0, 277). 


Proof. Let f(x) = coskx and gx(x) = ax fork € N. By the technique used in 
Example 6.34, we can show that 


n sin (5) + sin (x + ;)*) 
Dy(x) = nee zsin (2) 


forn € Nand x € (0, 277). Hence the partial sums of Yo k(x) satisfy 


sin (5) + sin (x + )*) 1 
|Dn(x)| = = — 
2sin (>) sin (5)| 


for x € (0, 27). If 6 = min{27 — b, a} and x € [a, b], then sin(x/2) > sin(é/2) 
(see Figure 7.4). Therefore, )°?° ; a, coskx converges uniformly on [a, b] by 
Dirichlet’s Test. a 


This example can be used to show that uniform convergence of a 
series alone is not sufficient for term-by-term differentiation. Indeed, although 
> coskx/k converges uniformly on [z/2, 37/2], its term-by-term derivative 
>. (— sinkx) converges at only one point in [7/2, 37/2]. 


#Y 


FIGURE 7.4 
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A double series is a series of numbers or functions of the form 
CO CO 
ye oa 
k=1 \ j=l 


Such a double series is said to converge if and only if pare akj converges for 
each k € N and 


exists and is finite. 

When working with double series, one frequently wants to be able to change 
the order of summation. We already know that the order of summation can be 
changed when ax; > 0 (see Exercise 6.3.6). We now prove a more general result. 


(The elegant proof given here, which comes from Rudin [11],! uses uniform 
convergence.) 
7.18 Theorem. Let ax; €¢ R fork, j ¢ N and suppose that 
Co 
Aj = > laxj| < 00 


k=1 


for each j EN. If pare A; converges (i.e., the double sum converges absolutely), 
then 


CO CO CO CO 
Dm = DD ai 
k=1 j=l j=lk=l 


Proof. Let E = {0, 1, 4 3 is ...}. For each j € N, define a function f; on E by 


Co 1 n 
fOr age. F (~) =)iaj, nen. 
k=l k=l 


By hypothesis, f;(0) exists and by the definition of series convergence, 


1 
Jim, 5 ( ) = fj (0); 


lWalter Rudin, Principles of Mathematical Analysis, 3rd ed. (New York: McGraw-Hill Book 
Co., 1976). 
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that is, fj; is continuous at 0 € E for each j € N. Moreover, since | fj(x)| < Aj 
for all x € E and j € N, the Weierstrass M-Test implies that 


FOS ye) 


j=l 


converges uniformly on E. Thus f is continuous at 0 € E by Theorem 7.9. 
It follows from the sequential characterization of continuity (Theorem 3.21) 
that f(1/n) > f(O) asn — oo. Therefore, 


Ce Co n CO [oe n 
Does =e aa = De ae 
k=1 j=1 k=1 j=1 j=lk=1 
=m Di (F)= iar G)=20=o 00 
j=l j=lk=1 
EXERCISES 


7.2.1. a) Prove that }°7°., cos(x/k*) — 1 converges uniformly on any bounded 
interval in R. 


b) Prove that }°725 ek x=1 converges uniformly on any closed subin- 
terval of (0, co). 


7.2.2. Prove that the geometric series 


1-x 


1 
pe 
? 
k=0 


converges uniformly on any closed interval [a, b] c (1, 1). 
723. Ler BG) = 7= gt [ ki 
a) Prove that the series defining E(x) converges uniformly on any closed 
interval [a, b]. 
b) Prove that 


b 
} E(x) dx = E(b) — E(a) 


foralla,beR. 
c) Prove that the function y = E(x) satisfies the initial value problem 


y-y=0, yO=1. 


[We shall see in Section 7.4 that E(x) = e*.] 
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7.2.4. 


7.2.5. 


7.2.6. 


7.2.7. 


7.2.8. 


*7.2.9. 
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Suppose that 
= sin(kx) 
f@=) a 
k=1 
Prove that 
x /2 09 (-1)**! 4+] 
[ (@d-) a 
k=1 
Show that 
== cl x 
= 1 
f(x) > l(a) 


converges, pointwise on R and uniformly on each bounded interval in R, 
to a differentiable function f which satisfies 


[f(@)| < |x| and |f’(x)| <1 


for allx € R. 
Prove that 
Co 
ae (1 — cos(1/k))| < 2. 
k=l 
Suppose that f = }°7°, fe converges uniformly on a set E CR. If g; is 


bounded on E and gx(x) > geii(x) > Oforallx ¢ E andk EN, prove 
that )°7°, fe gx converges uniformly on E. 

Let n > 0 be a fixed nonnegative integer and recall that 0! := 1. The 
Bessel function of order n is the function defined by 


Co 


(-1)* x \ n+2k 
eae) =) aa +b! (3) 


k=0 


a) Show that B,(x) converges pointwise on R and uniformly on any 
closed interval [a, b]. 
b) Prove that y = B,(x) satisfies the differential equation 


" 


xy +xy' + (x? — n)y -—0 


forx ER. 
c) Prove that 
(x" Bn(x))! = x” Bn-1(x) 


forn €Nandx eR. 
Suppose that a, | 0 ask — oo. Prove that )°7°, a, sinkx converges 
uniformly on any closed interval [a, b] C (0, 27). 
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7.2.10. Suppose that f|, fo, ... are continuous real functions defined on a closed, 


bounded interval [a,b]. If0 < fev) < fe4i(x) for allk € Nandx € 
[a, b], and if f, — f uniformly on [a, b], prove that 


b [2 ifn b 
jim (>: fees) dx = / f (x) dx. 
a k=1 a 


7.3 POWER SERIES 


Polynomials are functions of the form P(x) = ue, arx*, where a, € R and 
n > 0. In this section we investigate a natural generalization of polynomials, 
namely, series of the form )\?°9 axx*. 

Actually, we shall consider a slightly more general class of series. A power 
series (centered at xo) is a series of the form 


S(x) = Yo ag(x — x0)", 


k=0 


where we use the convention that (x —x9)° = 1. In fact, although 0° is in general 
indeterminate, when dealing with power series we always interpret 0° = 1. 

Since S(x) is identically ag when x = xp, it is clear that every power series 
converges at at least one point. The following result shows that this may be the 
only point. 


7.19 Remark. There exist power series which converge only at one point. 


Proof. For eachx 40, (ke x|/)1/* = k|x| > 00 as k > oo. Therefore, by the 
Root Test, the series )°7-,, k*x* diverges when x # 0. a 


In general, a series of functions can converge at several isolated points. [For 
example, the series )-?~., sin(kx) converges only when x = nz for some n € Z.] 
We shall see (Theorem 7.21 below) that this cannot happen for power series. 
Hence, we introduce the following concept. 


7.20 Definition. 


An extended real number R is said to be the radius of convergence of a power 
series S(x) := ae, a(x — xo)* if and only if S(x) converges absolutely for 
|x — xo| < R and S(x) diverges for |x — xo| > R. 


The extreme cases are R = 0 and R = oo. When R = 0, the power series S(x) 
converges only when x = x9. When R = o, the power series S(x) converges 
absolutely for every x € R. 

The next result shows that every power series S has a radius of convergence 
which can be computed using roots of the coefficients of S. 
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7.21 Theorem. Let S(x) = a a(x — xo) bea power Series centered at xo. If 
R = 1/lim sup; _, .,|ax|!/*, with the convention that 1/00 = 0 and 1/0 = 0x, then 
R is the radius of convergence of S. In fact, 


i) S(x) converges absolutely for each x € (xo — R, x0 + R), 
ii) S(x) converges uniformly on any closed interval [a, b] C (xo — R, x9 + R), 
iii) and (when R is finite), S(x) diverges for each x ¢ [xo — R, xo + R]. 


Proof. Fix x € R,x 4 xo, and set p := 1/lim SUP; , olakl!/*, with the conven- 
tion that 1/oo = 0 and 1/0 = ow. To apply the Root Test to S(x), consider 


I/k 1k 


r(x) :=lim sup lag (x — x0)*|!/* = |x — xo| - lim sup |ag| 
k>oo k>oo 


Case I. p = 0. By our convention, p = 0 implies r(x) = oo > 1, so by 
the Root Test, S(x) does not converge for any x 4 x9. Hence, the radius of 
convergence of Sis R=0= op. 

Case 2. p = oo. Then r(x) = 0 < 1, so by the Root Test, S(x) converges 
absolutely for all x € R. Hence, the radius of convergence of S$ is R = co = p. 

Case 3. p € (0,00). Then r(x) = |x — xo|/p. Since r(x) < 1 if and only 
if |x — x9| < p, it follows from the Root Test that S(x) converges absolutely 
when x € (xo — p, X9 +p). Similarly, since r(x) > 1 if and only if |x — x9| > p, 
we also have that S(x) diverges when x ¢ [xo — p, x9 + e]. This proves that p 
is the radius of convergence of S, and that parts i) and iii) hold. 

To prove part ii), let [a, b] C (xo—R, xo + R). Choose an x; € (xo—R, x9 +R) 
such that x ¢€ [a,b] implies |x — xo| < |x1 — xo| (see Figure 7.5). Set 
My = |ax||x1 — xo|* and observe by part i) that yo Me converges. Since 
lax(x — xo)*| < My for x € [a,b] and k € N, it follows from the Weierstrass 


M-Test that S(x) converges uniformly on [a, b]. a 
€ o—_f —= } . 
xy-R xy a xX Ob Xo +R 
FIGURE 7.5 


The following result, which is weaker than Theorem 7.21 (see Exercise 6.3.8), 
provides another way to compute the radius of convergence of some power 
series (see also Exercise 7.3.8). This way is easier when ax contains products 
(e.g., factorials). 


7.22 Theorem. [f the limit 


; ak 
R= lim lax 
k->oo |ag+1| 
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exists as an extended real number, then R is the radius of convergence of the power 
series S(x) = \ P29 ax (x — xo). 


Proof. Repeat the proof of Theorem 7.21, using the Ratio Test instead of the 
Root Test, to find that S(x) converges absolutely on (xo — R,xo + R) and 
diverges for each x ¢ [xo — R, x9 + R]. By Definition 7.20, R must be the 
radius of convergence of S(x). Hi 


7.23 Definition. 


The interval of convergence of a power series S(x) is the largest interval on 
which S(x) converges. 


By Theorem 7.21, for a given power series S = yo a(x — xo)‘, there are 
only three possibilities: 


i) R =o, in which case the interval of convergence of S is (—0o, 00), 
ii) R = 0, in which case the interval of convergence of S is {xo}, and 
iii) 0 < R < oo, in which case the interval of convergence of S is 
(xo — R, x9 + R), [xo — R, x0 + KR), (Xo — R, X0 + RI], or [xo — R, xo + RI. 


To find the interval of convergence of a power series, therefore, one needs to 
compute the radius of convergence R first. If0 < R < ow, one must also check 
both endpoints, x9 — R and xo + R, to see whether the interval of convergence 
is closed, open, or half open/closed. Notice once and for all that the Ratio and 
Root Tests cannot be used to test the endpoints, since it was the Ratio and Root 
Tests which gave us R to begin with. 


7.24 EXAMPLE. 
Find the interval of convergence of S(x) = be ae 1 KP nl K 


Solution. By Theorem 7.22, 


k+1 k+1 
R= lim VEE = | im Ow O24. 
k->oo Vk k>o Ok 
Thus, the interval of convergence has endpoints 1 and —1. S(x) diverges at 
x = | by the p-Series Test and converges at x = —1 by the Alternating Series 
Test. Thus, the interval of convergence of S(x) is [—1, 1). a 


7.25 Remark. The interval of convergence may contain none, one, or both its 
endpoints. 
Proof. By Theorem 7.22, the radius of convergence of each of the series 
OO Lk 


Ce i lo.e) xk x 
ds ; ee oe 
=1 


k=1 =1 
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is 1, but by the Divergence Test, the Alternating Series Test, and the p-Series 
Test, the intervals of convergence of these series are (—1, 1), [—1, 1), and 
[—1, 1], respectively. 


We now pass from convergence properties of power series to the calculus of 
power series. The next several results answer the question, What properties 
(e.g., continuity, differentiability, integrability) does the limit of a power series 
satisfy? 


7.26 Theorem. If f(x) = bean, ay(x —x0)* is a power series with positive radius 
of convergence R, then f is continuous on (xp — R, xo + R). 


Proof. Let x € (xo — R, xo + R) and choose a, b € R such that x € (a, b) and 
[a,b] C (x9 — R, x9 + R). By Theorems 7.2111 and 7.141, f is continuous on 
(a, b) and hence at x. | 


The following result shows that continuity of the limit extends to the end- 
points when they belong to the interval of convergence. 


7.27 Theorem. [ABEL’S THEOREM]. 
Suppose that [a, b] is nondegenerate. If f(x) := yea ay(x — xo)* converges 
on [a, b], then f (x) is continuous and converges uniformly on [a, b]. 


Proof. By Theorems 7.21ii and 7.26, we may suppose that f has a positive, 
finite radius of convergence R, and, by symmetry, that a = x9 andb = x04 R. 
Thus, suppose that f(x) converges at x = x9 + R and fix x; € (x9, x9 + RI. 
Set by = a,R* and cy = (x1 — xo)‘/R* for k € N. By hypothesis, )°?° , bx 
converges. Hence, given ¢ > 0, there is an integer N > | such that 


k 
k>m2>WN_ imply S°b; <&, 
j=m 


Since 0 < x; — x9 < R, the sequence {c;} is decreasing. Applying Abel’s 
Formula and telescoping, we have 


n n 
SS ax(x = wo! = So beck 
k=m k=m 
n n—-l k 
= Jen \ ibe + Do (ce — ces) Y) Bj 
k=m k=m j=m 


< CnE + (Cm — Cn)E = CmE. 
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Since cm <c, < R/R = 1, it follows that 


Yo ax (x1 — x0) 


k=m 


<E 


for all x; € (xo, x9 + RJ. Since this inequality also holds for x; = xo, we 
conclude that 6 a(x — xo)* converges uniformly on [xo, xo + R]. | 


7.28 Remark. If a power series S(x) = a ar ar(x — xo)* converges at some 
x1 > xo, then S(x) converges uniformly on [xo, x1] and absolutely on [xo, x1). 
It might not converge absolutely at x = x. 


Proof. By Theorems 7.21 and 7.27, S(x) converges uniformly on [xo, x;] and 
absolutely on [xg, x,). The power series bares (—x)* /k converges uniformly 
on [0, 1] but not absolutely at x = 1. a 


To discuss differentiability of the limit of a power series, we first show that 
the radius of convergence of a power series is not changed by term-by-term 
differentiation (compare with Exercise 2.5.6). 


7.29 Lemma. 
Tf a, € Rforn EN, then 


lim sup(n|aq|)!/" = lim sup la, |!/”. 
n—->Co n—>Co 


Proof. Let « > 0. Since n'/" _5 1 asn — oo, choose N € Nso thatn > N 
implies 1 — ¢ <n!/" < 1+ ¢; that is, 


(1 —s)lan|!/" < (alan)? < 1 +.)lan|". 


It follows that ifn > N, then supy sn (klax|)!/* < (1+ 6) sup,., lax |1/*. Taking 
the limit of this last inequality, as n — oo, we have by definition that 


x := lim sup(n|an|)'/" < (1 + €) limsup |an|!/" =: (1 + e)y. 
n—->Co n—> Co 


Taking the limit of this inequality as ¢« — 0, we obtain x < y. A similar 
argument, using (1 — «) in place of (1 + €), proves that x > y. We conclude 
that x = y as promised. | 


We use this result to prove that each power series with a positive radius of 
convergence is term-by-term differentiable. 


7.30 Theorem. /f f(x) = S729 ax(x —x0)* is a power series with positive radius 
of convergence R, then f'(x) = \-p-., kag (x — xo)! for x € (xo — R, xo + R). 
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Proof. Let I := (xp — R, xo + R) and suppose that a, b € R satisfy [a, b] C I. 
By Lemma 7.29 and hypothesis, the radius of convergence of the series 
g(x) = yar kax(x — x9)* is R. Thus g converges absolutely on J and uni- 
formly on [a, b]. 

Consider the derived series S*(x) := S772 kag (x — xo)*—!. Since $*(xo) has 
only one nonzero term, the series $*(xo) converges absolutely. If x € I\{xo}, 
then S*(x) = g(x)/(x — xo), So again, S*(x) converges absolutely. It follows 
that the radius of convergence of S* is at least R. Hence, by Theorems 7.21 
and 7.14iii (term-by-term differentiation), f is differentiable on [a,b] and 
S*(x) = f(x) for all x € [a, b]. Since any x € J belongs to some [a, b] Cc TJ, 
we conclude that f’(x) = S*(x) for all x € I. B 


NOTE: A similar proof shows that S* diverges for all x ¢ [xo — R, x9 + R], so 
the radius of convergence of S* is exactly R. 


Recall that for each nonempty, open interval (a, b), C°(a, b) represents the set 
of functions f such that f“ exists and is continuous on (a, b) for all k € N. The 
following result generalizes Theorem 7.30. 


7.31 Corollary. If f(x) = > 7729 ak (x — xo)* has a positive radius of convergence 
R, then f € C™(xo — R, xo + R) and 


oC 


i} 
f©) = Ys —— ana — x9)" (7) 
2 (n —k)! 


forx € (40 — R, x9 + R) andk EN. 


Proof. The proof is by induction on k. By Theorem 7.30 and the fact that 
O! := 1, (7) holds for k = 1 and x € (xo — R, x9 + R). If (7) holds for some 
k € Nandall x € (xo — R, xo + R), then f™ isa power series with radius of 
convergence R. It follows from Theorem 7.30 that 


love) / 
+) -) — (¢Mpvy — eee, Ree 
fg) = (> Go pine ~ 70) 
= Gy (x — x9)" *7! 
parts (n—k-—1)! 
for all x € (xo — R, x9 + R). Hence, (7) holds for k + 1 in place of k. | 


The following result shows that each power series with a positive radius of 
convergence can also be integrated term by term. 
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7.32 Theorem. Let f(x) = a a(x — x0)* bea power series and leta,beR 
with a < b. 


i) If f(x) converges on [a, b], then f is integrable on [a, b] and 


b ee) b 
/ f(x) ax= >a f (x — xo)* dx. 
a k=0 a 


*ii) If f (x) converges on [a, b) and if \yy>.9 ax(b — xo)k+!/(k +1) converges, then 
fis improperly integrable on (a, b) and 


b i) b 
i f(x) ax= >a f (x — xo)* dx. 
a k=0 a 


Proof. i) By Abel’s Theorem, f(x) converges uniformly on [a, b]. Hence, by 
Theorem 7.14ii, f(x) is term-by-term integrable on [a, b]. 
ii) Leta <t <bandset A= Yeo aga — xo)t"/(k +1). By part i), 


t as t or) 
/ f(x) dx = Ya f (x — xo) dx = aes axgyer _A. 


=0 


The leftmost term of this last difference is a power series which by hypothesis 
converges at t = b. Thus, by the definition of improper integration and Abel’s 
Theorem, 


b t 
[ seoar= jim [ f(x) dx 


: ak 
= lim —— 
t>b— k+1 


oo ak oo b 
= b—x)t!_A= / =p)" dx, a 
aaa X0) do : (x — xo)" dx 


(t x xo)**! —A 


The following result shows that the product of two power series is a power 
series. (For a result on the division of power series, see Taylor [13], p. 619.) 


7.33 Theorem. If f(x) = Pog anx* and g(x) = Yeo bex* converge on 
(—r,r) and 


k 
=)" abe, k=0,1,..., 
j=0 


then Yo cex* converges on (—r,r) and converges to f (x)g(x). 
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Proof. Fix x € (—r,r) and for eachn € N, set 
n n n 
In(x) = ya’, 8n(X) = byx*, and hp (x) = ye cp. 
k=0 k=0 k=0 


By changing the order of summation, we see that 
nek n n 
hy(x) = ye Saja = > ajx Ds ae 
k=0 j=0 j=0 k=j 


= Ye ajx! en—7@) = g(x) fn(x) + S| ajx! (Bn—j) — g(x)). 


j=0 j=0 


Thus, it suffices to show that 


slim, Y7 aj! (@n—j(2) — (2) = 0. 
j=0 


Let ¢ > 0. Since f(x) converges absolutely and g,(x) converges as n > ov, 
choose M > 0 such that 729 |axx*| < M and 


Ign—j(®) — g@@)| <M 


for all integers n > j > 0. Similarly, choose N € N such that 


CO 
é € 
; : a sed aa 
£>N implies |ge(x) — g(x)| < aM and a lajx’| < aM" 
j= 


Let n > 2N. Then 


YS ajx! (Bn—j(x) — g(x)) 


j=0 
N n 
=|) ajx!(gn_j(%) — 8) + > ajx!(en_j(%) — 8@)) 
j=0 j=N41 
é = ‘ : é é 
< 599 2 lajx/1 + M > lajx/|<5+5=8. o 


j=0 j=N+1 
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7.34 Corollary. Suppose that az,by € R and that cy := ah for 
k=0,1, .... [f either 


i) p29 ax and °°°.9 be both converge, and at least one of them converges abso- 
lutely, 


ii) on, if Pep ak, Yopeg de, and Y~P-.9 cx all converge, 


then 
Ce Co CO 
ye (>: a) 3 7) | (8) 
k=0 k=0 
Proof. i) Repeat the proof of Theorem 7.33 with x = 1. 


ii) By hypothesis, the radii of convergence of )7?2y agx*, S7P9 byx*, and 
yo cex* are all at least 1; hence, by Theorem 7.33, 


e cyx* = (> as!) (>: ne) (9) 
k=0 k=0 k=0 


for x € (—1, 1). But by Abel’s Theorem (Theorem 7.27), the limit of (9) as 
x t Lis (8). | 


The hypotheses of Corollary 7.34 cannot be relaxed. 


*7.35 EXAMPLE. 
If ay = by = (—1)*/ Vk for k € N and ap = bo = 0, then "P29 cx diverges. 


Proof. If >~ 7.9 ck converges, then cy > 0 as k > oo. But fork > 1 odd, 


(k=1)/2 i 


oe =2 ——_——- 
k . : . : 
F 1 ViVE =F k—j Jel VINK = j k-j 


- 2o(“") (ae =a) = 5) =e 


Thus cg, cannot converge to zero, a contradiction. | 


We close this section with some optional material on finding exact values of 
convergent power series. Namely, we show how term-by-term differentiation 
and integration can be used in conjunction with the geometric series to obtain 
simple formulas for certain kinds of power series. Such formulas are called 
closed forms. 
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*7,.36 EXAMPLE. 


Find a closed form of the power series 


f@a= Sat 
k=1 


Solution. Since the interval of convergence of this power series is (—1, 1), we 
have by Theorems 7.32 and 6.7 (the Geometric Series) that 


f(t) fe ik x 
aaa pe di= > x ae 


for each x € (—1,1). [Note that f(x)/x is defined at x = 0 and has value 1.] 
Hence, by the Fundamental Theorem of Calculus, 


£@)-- x = 1 
ee (=) ~ (—x)? 


and it follows that 


XxX 
ee —1,1). a 
fO=qrpr FECL 
“7.37 EXAMPLE. 
Find a closed form of the power series 
oO Lk 
x 
se rat 


Solution. Since the interval of convergence of this power series is [—1, 1), we 
have by Theorem 7.30 that 


k+l 


eso = (25) = ys = 


k=0 


for x € (—1, 1). Hence, by the Fundamental Theorem of Calculus, 


wo = f = —log(1 — x) 
Gs dot 


for x € (—1, 1). Since g(—1) exists and log(1 — x) is continuous at x = —1, we 
conclude by Abel’s Theorem that 


g(x) = —8E x €[—1,1)\ {0}, and g(0)=1. | 
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EXERCISES 


7.3.1. Find the radius of convergence of each of the following power series. 


7.3.2. 


7.3.3. 


7.3.4, 


7.3.5. 


7.3.6. 


ee k 
b) © (14+) x* 
k=0 
c) 3 52k? 2k? 
k=0 
CO 
d) yo pk+1,? 
k=0 


Find the interval of convergence of each of the following power series. 


b) 3° (5—(-b*)* @ — Df 
k=0 
oo k+1 

c) ¥ log (+) (x + 1)* 
k=1 


i 


2k 
Fo A, ORD) 


Suppose that )°7° 9 ayx* has radius of convergence R € (0, 00). 


a) Find the radius of convergence of )*?o9 azx7*. 


b) Find the radius of convergence of )7?2.9 azx*. 


Suppose that |ax| < |b,| for large k. Prove that if )7°°, byx* converges 
on an open interval /, then )°7°.5 ayx* also converges on J. Is this result 
true if open is omitted? 

Suppose that {ax}?°.) is a bounded sequence of real numbers. Prove that 


f(x) := »— ayx* 
k=0 


has a positive radius of convergence. 
A series )°7°.9 ag is said to be Abel summable to L if and only if 


CO 
lim Ss ayr* —i bo 
k=0 


r>1- 


a) Prove that if }°7°)a, converges to L, then )°?°,ax is Abel 
summable to L. 
b) Find the Abel sum of )772.9(—L¥. 
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*7,3.7. Find a closed form for each of the following series and the largest set 
on which this formula is valid. 


a) 3 5k+1 
k=1 


Co 
b) > 3kx*-? 
k=2 


co 65k 
ea le 


ae 
oo dk 
d er 
2 k+1 
*7,3.8. If )°?°, axx* has radius of convergence R and a, # 0 for large k, 
prove that 
lim inf = < R < limsup me ; 
k->oo |k+1 k>o0 |4k+1 


*7,3.9. Prove that 
oo z k 
as (; = =) 
k=0 
is differentiable on (—2, 2) and 
2 
ti 
as 
Ole o—p 


for0O <x <2. 
7.3.10. Suppose that a, | 0 ask — oo. Prove that given e > 0 there isa dé > 0 
such that 


> (-Dhar(x* — y)] <e 


k=0 


for all x, y € [0, 1] which satisfy |x — y| < 6. 
*7.3.11. a) Prove the following weak form of Stirling’s Formula (compare with 
Theorem 12.73): 


n n+1 


! 
Ps er 


e”— 


b) Find all x € R for which the power series 


converges absolutely. 
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7.4 ANALYTIC FUNCTIONS 


In this section we study functions which can be represented by power series. 
(For a discussion of how to represent functions by trigonometric series instead 
of power series, see Chapter 14.) We begin with the following definition. 


7.38 Definition. 


A real-valued function f is said to be (real) analytic on a nonempty, open 
interval (a, b) if and only if given xg € (a, b) there is a power series centered 
at x9 which converges to f near xo; that is, if and only if there exist coefficients 
{ax}72.9 and points c,d € (a, b) such that c < x9 < d and 


f(x) = Yo ag(x — x0)" 
k=0 


for all x € (c,d). 


We shall develop several techniques for showing that a given function is 
analytic. To simplify statements of results, we continue to use the conventions 
f© := f andO! :=1. 

First, it is important to realize that if f is analytic on an open interval J, then 
for each center xo there is one and only one power series that represents f near 
xo, and that power series has the same coefficients that the Taylor polynomi- 
als have. 


7.39 Theorem. [UNIQUENESS]. 

Let c, d be extended real numbers with c < d, let xp € (c,d), and suppose that 
f:(.4a) >R If f(x) = Vey an(x — x0)* for x € (c,d), then f € C®(c, d) 
and 


_ FE) 


ae = 
k ki , 


k=0,1,-:-. 


Proof. Clearly, f(xo) = ao. Fix k € N. By hypothesis, the radius of con- 
vergence R of the power series )op° 9 ax(x — xo)* is positive and (c,d) C 
(xo — R, xo + R). Hence, by Corollary 7.31, f € C™®(c, d) and 


[e,2) 


! 
f§@M=y ani Gx) * (10) 


n=k 
for x € (c,d). Apply this to x = xg. The terms on the right side of (10) 
are zero when n > k and k!a; when n = k. Hence, f (xo) = kila, for 
eachk EN. | 


This “locally unique” power series has a name. 
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7.40 Definition. 


Let f € C™(a, b) and let x9 € (a, b). The Taylor expansion (or Taylor series) 
of f centered at xo is the series 


— f (xo) 
2 kl (x — xo). 


(No convergence is implied or assumed.) The Taylor expansion of f centered 
at x9 = 0 is usually called the Maclaurin expansion (or Maclaurin series) of f. 


Theorem 7.39 not only says that the power series representation of an ana- 
lytic function is locally unique. It also says that every analytic function is a C® 
function. The next remark shows that the converse of this statement is false. 


7.41 Remark. [CAUCHY]. The function 


etx? x #0 
PO)=)q y=70 


belongs to C®(—o0, 00) but is not analytic on any interval which contains x = 0. 


Proof. It is easy to see (Exercise 4.4.7) that f € C®(—oo, 00) and f (0) = 0 
for all k ¢ N. Thus the Taylor expansion of f about the point x) = 0 is 
identically zero but f(x) = 0 only when x = 0. a 


One of our aims in this section is to prove that many of the classical C™ 
functions used in elementary calculus are analytic on their domain. Since, by 
Theorem 7.39, a C™ function f is analytic on an open interval / if and only if its 
Taylor expansion at each xp € J converges to f near xo, the following concept is 
useful in this regard. 


7.42 Definition. 


Let f € C®(a, b) and xo € (a, b). The remainder term of order n of the Taylor 
expansion of f centered at xo is the function 


' BN A) 
Ri(x) = REO) =F) - NL Oy 
k=0 


In fact, by Theorem 7.39 and Definition 7.42, a function f € C®(a, b) is analytic 
on (a, b) if and only if for each xg € (a, b) there is an interval (c, d) containing 
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x9 such that Ri — 0, asn — ov, for every x € (c,d). We shall use this 
observation frequently below. 

By Taylor’s Formula (Theorem 4.24) the remainder term of an f € C™(a, b) 
satisfies 


. (n) 
RL (x) = LOG — x9)" 


for some c between xo and x (note the index shift from n + 1 ton.) Therefore, it 
should come as no surprise that there are several results that state the following: 


If the nth derivative of f satisfies a certain condition, then f is analytic on (a, b). 
Here is a particularly simple but useful result of this type. 


7.43 Theorem. Let f ¢ C™(a, b). If there isan M > 0 such that 
FP G(s" 


for all x € (a,b) andn €N, then f is analytic on (a,b). In fact, for each xo € 
(a, b), 


LO A(R) 
fos yt “ (x — x9)! 
k=0 


holds for all x € (a, b). 


Proof. Fix xo € (a,b) and set C = max{M|a — xo|, M|b — xo|}. By Taylor’s 
Formula, 


IFO! nw C 
— Xxo| 


[RP (x)| = —— |x Sg Se 
n! n! n! 


for alln € N. But C"/n! > 0 asn — oo for any C ¢€ R (being terms of 
a convergent series by the Ratio Test). Thus, by the Squeeze Theorem, the 


remainder term Re *0 4) converges to zero for every x € (a, b). a 


Here are three examples of Theorem 7.43 in practice. 


7.44 EXAMPLE. 


Prove that sin x and cos x are analytic on R and have Maclaurin expansions 


oo (—1) x24! 0° (= 1)kx 2k 


sinx = eae COS XxX = 5 -——_— Qk)! i (11) 


k=0 


Proof. In Example 4.26 [see (19) there], we proved that the Taylor series 
of f(x) := sinx centered at x9 = O is S(x) := Yyeg(—L x24! /(2k + 1)! 
Since f(x) is +sinx or +cos x, it is clear that | f(x)| < 1 for all x € R. 
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Therefore, it follows from Theorem 7.43 that sinx is analytic on R and that 
the left side of (11) holds everywhere on R. A similar argument proves the 
right side of (11). g 


7.45 EXAMPLE. 


Prove that e* is analytic on R and has Maclaurin expansion 
x 
ee 
e= ) r: (12) 


Proof. In Example 4.25 [see (18) there], we proved that the Taylor series of 
f (x) := e* centered at xo = O is S(x) = Dg x*/ ki. 

Fix C > 0and notice that | f (x)| = Je*| < e© =: M < M" foralln € Nand 
x € [-C, C]. It follows from Theorem 7.43 that e* is analytic on [—C, C] and 
that S(x) converges to e* everywhere on [—C, C]. Since C > 0 was arbitrary, 
we conclude that (12) holds for all x € R. ) 


Sometimes, it is impractical to get the kind of global estimates on the deriva- 
tives of f necessary to apply Theorem 7.43. The following result, which shows 
that the center of a power series can be changed within its interval of conver- 
gence, is sometimes used to circumvent this problem. 


7.46 Theorem. Suppose that I is an open interval centered at c and that 
Co 
fx)= dl ate-of, xed. 
k=0 
If xo € LT andr > O satisfy (xo —r,xo0 +r) C I, then 
lee) k) 
i ( (xo) k 
FO) = — 20) 
k=0 


forall x € (x9 —1r, x0 +r). In particular, if f is aC function whose Taylor series 
expansion converges to f on some open interval J, then f is analytic on J. 


Proof. It suffices to prove the first statement. By making the change of vari- 
ables w = x — c, we may suppose that c = 0 and J = (—R, R); that is, that 
f(x) = Peo axx*, for all x € (—R, R). Suppose that (xo—r, xo +r) C (—R, R) 
and fix x € (x9 —r, x9 +r). By hypothesis and the Binomial Formula, 


f= 0 agx® =F ag (x — x0) + x0) => ax 
k=0 


k=0 k=0 


(‘) Fate (x — x9)/. (13) 


k 
j=0 
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Since \°7?o.9 ax y* converges absolutely at y := |x — xo| + |xo| < R, we have 


le) k 
2 AE: “Ie = x0)! Dm ( bolt ibe =e 
j=0 


k=0 


k 
lag (lx — xo| + |xo|)" < 00. 


pea 


~ 
ll 


0 


Hence, by (13), Theorem 7.18, and Corollary 7.31, 
loo) ee 
fa)= 2 ak > “(; ae (x — x0)! 
iz 
[e,2) Ce 
— Pe ss (‘) aRxh j (x — xo)! 


j=0 \k=j 

co f oy Nie toatg: OR RG . 
=o no — OFF Be Gh. 
j=0 ta | ys J: = J! 


7.47 EXAMPLE. 


Prove that arctan x is analytic on (—1, 1) and has Maclaurin expansion 


oo ky 2k+1 
—1 
arctan x = ) ee x € (1,1). 
= 2k+1 


Proof. For each 0 < x < 1, the geometric series rg (—1)*t* converges 
uniformly on [—x, x] to 1/(1 + t”). Thus, by Theorem 7.32, 


( 1)* 2k+1 


t = 1)*12* d Boe ety 
arctan x [=- fre yt = al 


k=0 


By uniqueness, this is the Maclaurin expansion of arctan x. Since this expan- 
sion converges on (—1, 1), it follows from Theorem 7.46 that arctan x is analytic 
on (—1, 1). 


In Examples 7.44 and 7.45, we found the Taylor expansion of a given f by 
computing the derivatives of f and estimating the remainder term. In the 
preceding example, we found the Taylor expansion of arctanx without comput- 
ing its derivatives. This can be done in general, using term-by-term differenti- 
ation or integration or products of power series, when the function in question 
can be written as an integral or derivative or product of functions whose Taylor 
series are known. Here are two more examples of this type. 


268 Chapter 7 Infinite Series of Functions 


7.48 EXAMPLE. 


Find the Maclaurin expansion of arctan x/(1 — x). 


Proof. By Theorem 7.33 and Example 7.47, for each |x| < 1, 


aon = (>*) (Ere) 
( 1—-x a =a 2k +1 


= (poo). 
k=0 eae 
where Ax := {jf EN: 0 <j < (k—1)/2}. a 


7.49 EXAMPLE. 


Show that the Taylor expansion of log x centered at x9 = 1 is 


l mele p* 0,2 
cee 2, k (x — 1) x € (0, 2). 


Proof. By Theorem 7.32, for each x € (0, 2), 


\ _fra_ f* dt 
oer = | rah a= 
oe k 
= yu d ey — 1). a 
[ (l—1)* dt = CD) 


In some situations it is useful to have an integral form of the remainder term. 
This requires a slightly stronger hypothesis than Taylor’s Formula but can yield 
a sharper estimate. 


7.50 Theorem. [LAGRANGE]. 
Letn EN. If f € C"(a, b), then 


R= ee — fo ty f(D at 
(n—1)! Jig 


for all x, xo € (a, b). 


Proof. The proof is by induction on n. If n = 1, the formula holds by the 
Fundamental Theorem of Calculus. 
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Suppose that the formula holds for some n € N. Since 


(n) 
Ran nes” a) (x —xo)" and 
(x = xo)” = 1 fo rae dt, 
n! (n — 1)! Jeo 


it follows that 


1 x 
Rr) = oy f 9! (FM) — C40) at. 


Letu = fMW) _ f™ (xo), dv = (x —t)"—! and integrate the right side of the 
identity above by parts. Since u(xg) = 0 and v(x) = 0, we have 


Rie) =—-—— f wood =— [ @—rpee at. 
(n — 1)! Jeg nM JSxo 


Hence, the formula holds for n + 1. | 


The rest of this section contains some additional (but optional) material on 
analytic functions. 


In order to generalize the Binomial Formula from integer exponents to real 
exponents (compare Theorem 1.26 with Theorem 7.52 below), we introduce the 
following notation. Let a € R and k be a nonnegative integer. The generalized 
binomial coefficient a over k is defined by 


a(a—l1)...(@a—k+1) 


= 

1 k=0. 
Notice that when aw € N, these generalized binomial coefficients coincide with 
the usual binomial coefficients, because in this case 5) =Ofork >a. 


*7.51 Lemma. 
Suppose that a, B € R. Then 
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Proof. The formula holds for k = 0 and k = 1. Ifit holds for some k > 1, then 
by the inductive hypothesis and the definition of the generalized binomial 
coefficients, 


Cee 

Sl) C) aa eee 
ACN O (GEG 
“(SG Ode) 
-¥ (est) @) . 


With this ugly calculation out of the way, we are prepared to generalize the 
Binomial Formula. 


II 
‘M = 


*7.52 Theorem. [THE BINOMIAL SERIES]. 
Ifa € Rand |x| < 1, then 


d+x)*=)> @ xk 


k=0 


In particular, (1 + x)® is analytic on (—1, 1) foralla ER. 


Proof. Fix |x| < 1 and consider the series F(a) := )°729 ({) x*, Since 


is independent of a, it follows from the proof of the Ratio Test that F con- 
verges absolutely and uniformly on R. Hence, F is continuous. Moreover, by 
Theorem 7.33 and Lemma 7.51, 
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()"2 (6) 
ee) G)* 


ue = F(a +). 


F@)F(B) = 


<2 


~ 
ll 
° 


Me 


k 


B 


~ 
ll 
° 


Hence, it follows from Exercise 3.3.9 that F(a) = F(1)%. Since 


CO 
F()) = Ds Ge —1+4+x, 
k=0 
we conclude that F(a) = (1 + x)® for all |x| < 1. | 


Lagrange’s Theorem gives us another condition on the derivatives of f suffi- 
cient to conclude that f is analytic. 


*7.53 Theorem. [BERNSTEIN]. 
If f € C®(a, b) and f™ (x) = 0 for all x € (a,b) and n €N, then f is analytic 
on (a, b). In fact, if x9 € (a, b) and fEOS 0 for x € [xo, b) andn EN, then 


OO Fk) 

fa=y ae (x — xo)! (14) 
k=0 : 

for all x € [xo, b). 


Proof. Fix x9 < x < bandn € N. Use Lagrange’s Theorem and a change of 
variables t = (x — xo)u + xo to write 


— xo)" 
—D! 


R(x) = RE (x) = ik (1—uy"-! f@(( —x9)u+x0) du. (15) 


Since f™ > 0, (15) implies R, (x) > 0. On the other hand, by definition and 
hypothesis, 


Ral) = F(X) ee 2) (5 — say < Fla). 


Therefore, 
0 < Rix) < f() (16) 


for all x € (x0, dD). 
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Let bo € (xo, b) and notice that it suffices to verify (14) for x9 < x < Do. 
(We introduce the parameter bo in order to handle the cases b € R and b = co 
simultaneously.) Since R,(xo) = O for all n € N, we need only show that 
R,(x) > O0asn > oo for each x € (Xx, Do). 

By hypothesis, f@tD(t) > 0 for t € [xo, b), so f™ is increasing on [xo, D). 
Since x < bo < b, we have by (15) and (16) that 


0 < R,(x) = oe (1 —u)"—! f (x — x0)u + x9) du 
_& ae 
~ (n—I)! 


= — *0 
= (= = 2) Rn (bo). 


Since (x — x9)/(bo — xo) < 1 and, by (16), Rn(bo) < f (bo), we conclude by the 
Squeeze Theorem that R,(x) — 0asn > oo. a 


i (1 —u)""! f™ (bp — xo)u + x0) du 


*7,.54 EXAMPLE. 


Prove that a* is analytic on R for eacha > 0. 


Proof. First suppose that a > 1. Since f™ (x) = (oga)" -a* > Oforallx eR 
andn EN, a* is analytic on R by Bernstein’s Theorem. If 0 < a < 1, then by 
what we just proved and a change of variables, 


k 


oo ky -l k oo k 
Si log" (a~*)(—x) log*a-x 
; 1l\-x _ » = y 
nes al lace k! % k! 
k=0 k=0 


Hence by Theorem 7.46, a’ is analytic on R. a 


Our final theorem shows that an analytic function cannot be extended in an 
arbitrary way to produce another analytic function. We first prove the following 
special case. 


*7.55 Lemma. 

Suppose that f, g are analytic on an open interval (c,d) and that xq € (c,d). If 
f(x) = g(x) for x € (c, xq), then there is a5 > 0 such that f(x) = g(x) for all 
x € (xp —6,x9 + 8). 


Proof. By Theorem 7.39 and Definition 7.38, there is a 5 > 0 such that 


o£ (k) 2D" Gh) 
FO) = pO = x) and ge) = EM = xo)* 7) 
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for all x € (xo — 6, x9 + 5). By hypothesis, f, g are continuous at x9 and 


F(x) = tim f(x) = lim _ g(x) = g(x0). (18) 


Similarly, f ® (x9) = g (x9) for k € N. We conclude from (17) that f(x) = 
g(x) for all x € (x9 — 6, x9 + 8). a 


*7,.56 Theorem. [ANALYTIC CONTINUATION]. 

Suppose that I and J are open intervals, that f is analytic on I, that g is analytic 
on J, and thata < bare points in I J. If f(x) = g(x) for x € (a,b), then 
Sf) = g(x) forallx EIN J. 


Proof. We assume for simplicity that J and J are bounded intervals. Since 
IN J #49, choose c,d € Rsuch that 1 1 J = (c, d) (see Figure 7.6). 


\* I al 


| 
| 
| 
| 
| i — 
| 
| 
| 
| 


aes: 


FIGURE 7.6 


Consider the set E = {t € (a,d) : f(x) = g(x) for all x € (a,t)}. By 
our assumption, d < oo and by hypothesis b € E. Thus E is bounded and 
nonempty. Let x9 = sup E. If x9 < d, then by Lemma 7.55 there is ad > 0 
such that f(x) = g(x) for all x € (xo — 6, x9 + 8). This contradicts the choice 
of xo. Therefore, x9 = d; that is, f(x) = g(x) for all x € (a,d). A similar 
argument proves that f(x) = g(x) for all x € (c, b). a 


EXERCISES 
7.4.1. Prove that each of the following functions is analytic on R and find its 
Maclaurin expansion. 
: 9x3 
a) sin(3x) — 3x + os 
b) a" 
c) 1—2sin’x 
2x 1 
d) © 
2x 


7.4.2. Prove that each of the following functions is analytic on (—1, 1) and find 
its Maclaurin expansion. 
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7.4.3. 


7.4.4, 


7.4.5 


7.4.6. 


7.4.7. 


7.4.8. 


Infinite Series of Functions 


e~ 


b) 


l+x 

1 

c) log | —————_ 
Epa areca 

d) arccos x 
For each of the following functions, find its Taylor expansion centered at 
xq = 1 and determine the largest interval on which it converges. 
a) 2e* 
b) log3(x*) 
Orne = 
d) fx 
Prove that if P is a polynomial of degree n and xo € R, then there are 


numbers fx € R such that 
P(x) = Bo + Bi — xo) +--+ + Ba@ — x0)” 
for allx ER. 


Let a > 0 and suppose that f € C°(—a, a). 


a) If f is odd [ie., if f(—x) = —f(x) for all x € (—a,a)], then the 
Maclaurin series of f contains only odd powers of x. 

b) If f is even [ie., if f(—x) = f(x) for all x € (—a,a)], then the 
Maclaurin series of f contains only even powers of x. 


Suppose that f € C°(—oo, oo) and that 


1 a 
lim -| x" f"D(q—x) dx =0 
0 


n>oo n! 


for alla € R. Prove that f is analytic on (—oo, oo) and 


OO ¢(k) 
OE ae ah, xeR. 
k=0 


a) Prove that 
1 n—1 
2 1 3 
d 
ie i do G+ DE =F 


forn EN. 
b) Show that 


1 
9.9253 < / & dx < 2.9254. 
—l 


Let f € C™(a,b). Prove that f is analytic on (a, b) if and only if f’ is 
analytic on (a, D). 
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7.4.9. Suppose that J is a nonempty open interval and that f is bounded and 
C™® on J. If there is an M > 0 such that | f“(x)| < Mk for all x € J 
and all positive integers k sufficiently large, and if there exist a,b € I 
such that 


b 
/ f(x) x" dx =0 


forn =0, 1,2, ..., then prove that f is zero on [a, b]. 
*7.4.10. Suppose that f is analytic on (—oo, oo) and that 


b 
/ If (x)| dx =0 


for some a 4 bin R. Prove that f(x) =O forall x €R. 


*7,4.11. Prove that 
eo) 1/B oo) 
( at) < ola | 
k=1 k=1 


for alla, € Randall 6 > 1. 


*7.5 APPLICATIONS 


This section uses no material from any other enrichment section. 

The theory of infinite series is a potent tool for both pure and applied mathe- 
matics. In this section we give several examples to back up this claim. 

We begin with a nontrivial theorem from number theory. Recall that an 
integer n > 2 is called prime if the only factors of n in N are 1 and n. Also recall 
that given n € N there are primes pj, p2,..., pg and exponents a1, @2,..., a 
such that 


Oy a2 


OK 
N=P, Py ---Py- 


7.57 Theorem. [EUCLID’S THEOREM; EULER’S PROOF]. 
There are infinitely many primes inN. 


Proof. Suppose to the contrary that p1, p2,..., px represent all the primes 


in N. Fix N € N and set a = sup{a,..., ax}, where this supremum is taken 
over all a;’s which satisfy n = Py! De ee a for some n < N. Since every 
integer j € [1, N] must have the form j = De a pe for some choice of 


integers 0 < e; < a, we have 


1 1 1 1 1 1 
1l+—+--+4 ee loge | a ee ae 
P1 P| P2 1) Pk Px 


276 Chapter 7 Infinite Series of Functions 


On the other hand, for each integer i € [1, k], we have by Theorem 6.7 that 


Consequently, 


pts =(=74). (GPa) aus 


j=l 


Taking the limit of this inequality as N — ov, we conclude that a 1/j < 
M < w,acontradiction. | 


Our next application, a result used to approximate roots of twice differen- 
tiable functions, shows that if an initial guess x9 is close enough to a root of a 
suitably well-behaved function f, then the sequence x, generated by (19) con- 
verges to a root of f. 


7.58 Theorem. [NEWTON-RAPHSON]. 
Suppose that f : [a,b] > Ris continuous on [a, b] and that f (c) = 0 for some 
c € (a,b). If f" exists and is bounded on (a, b) and there is an €9 > 0 such 
that | f’(x)| = €0 for all x € (a,b), then there is a closed interval I C (a,b) 
containing c such that given xo € I, the sequence {xn}nen defined by 


Xn-= Xo LGn-) neN, (19) 


Fig) 
Satisfies x, € I and x, > casn > o. 


Proof. Choose M > O such that | f”(x)| < M for x € (a, b). Choose rg € (0, 1) 
so small that J = [c —ro,c + ro] is a subinterval of (a,b) and ro < e9/M. 
Suppose that xo € J and define the sequence {x,} by (19). Set r := roM/eo 
and observe by the choice of rp that r < 1. Thus it suffices to show that 


l%n = cl =F" [xo = cl (20) 
and 


IXn — cl < ro (21) 


hold for alln € N. 
The proof is by induction on n. Clearly, (20) and (21) hold for n = 0. Fix 
n € N and suppose that 


lxn—1 —¢| <r” |x9 — | (22) 
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and that 
IXn—-1 — c| S ro. (23) 
Use Taylor’s Formula to choose a point € between c and x,_; such that 


1 
—f n-1) = fC) — fQn-1) = f' Gn-1)(¢ — Xn-1) + GG — Xn-1). 


Since (19) implies — f (%,-1) = f’(%n-1) (Xn — Xn—1), it follows that 


! I ” 2 
f n-1) On — c) = 5f (6) =an-1)"s 
Solving this equation for x, — c, we have by the choice of M and éo that 


f") 
2 f'(Xn-1) 


Since M/eo < 1/ro, it follows from (24) and (23) that 


M 
2 2 
[Xn —€ -| cre Re 2 roel" (24) 
0 


1 

2, 2 

Xn» —c| < IXn-1 — el < lrol” = ro. 
€0 rO 


This proves (21). Again, by (24), (22), and the choice of r, we have 
M 
ban =e] < =O" "xy = el)? = =O" x9 = cl?) < 7" ax0 = el. 
E€0 rO 


Since r < 1 and 2n — 1 > n imply r-"-! < r", we conclude that |x, —c| < 
7?9lix9 — cl <r" |x0 — cl. Z 


Notice if x,_; and x, satisfy (19), then x, is the x-intercept of the tangent line 
to y = f(x) at the point (x,_1, f(xn_1)) (see Exercise 7.5.4). Thus, Newton’s 
method is based on a simple geometric principle (see Figure 7.7). Also notice 
that, by (24), this method converges very rapidly. Indeed, the number of decimal 
places of accuracy nearly doubles with each successive approximation. 

As a general rule, it is extremely difficult to show that a given nonalgebraic 
number is irrational. The next result shows how to use infinite series to give an 
easy proof that certain kinds of numbers are irrational. 


7.59 Theorem. [EULER]. 
The number e is irrational. 


Proof. Suppose to the contrary that e = p/q for some p,q € N. By Exam- 
ple 7.45, 


[e,2) 


| (-1)* 
Be ae ua 


P k=0 
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AY 


#Y 


FIGURE 7.7 


Breaking this sum into two pieces and multiplying by (—1)?+! p!, we have 


xi=(-1)P*! (10 1)! yore) == 3 (1k 
k! 


k! 
k=0 k=p+l 


Since p!/k! € N for all integers k < p, the number x must be an integer. On 
the other hand, 


1 1 1 
Fel G@PDOtD @FherDOt 


y 


lies between 1/(p + 1) and 1/(p + 1) — 1/(p + 1)(p + 2). Therefore, y is a 
number which satisfies 0 < y < 1. In particular, x 4 y, a contradiction. | 


We know that a continuous function can fail to be differentiable at one point 


[e.g., f(x) = |x|]. Hence, it is not difficult to see that, given any finite set of 
points E, there is a continuous function which fails to be differentiable at every 
point in E. We shall now show that there is a continuous function which fails to 
be differentiable at all points in R. Once again, here is a clear indication that, 
although we use sketches to motivate proofs and to explain results, we cannot 
rely on sketches to give a complete picture of the general situation. 


7.60 Theorem. [WEIERSTRASS]. 
There is a function f continuous on R which is not differentiable at any point 
inR. 

(Note: Such functions are called nowhere differentiable.) 
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Ay 


#Y 


| 
an 
2 


FIGURE 7.8 


Proof. Let 


O<x<1/2 


Xx 
PO=)1_, oe<x<l 


and extend fo to R by periodicity of period 1, that is, so that fo(x) = fox +1) 
for all x € R (see Figure 7.8). Set fx (x) = fo(2kx)/2* forx € Randk € Nand 
consider the function 


fO=) fk), «eR. 
k=0 


Normalizing f; by 2* has two consequences. First, since fo) = +1 for each 
y which satisfies 2y ¢ Z, it is easy to see that 


f/(”) =+1 for each y which satisfies 2+! y ¢ Z. (25) 


Second, by the Weierstrass M-Test, f converges uniformly and, hence, is con- 
tinuous on R. 

Since f is periodic of period 1, it suffices to show that f is not differentiable 
at any x € [0,1). Suppose to the contrary that f is differentiable at some 
x € [0, 1). For eachn € N, choose p € Z such that x € [a,, B,) for a, = p/2” 
and 6B, = (p + 1)/2”. Since each f; is linear on [ag+41, By41] and [an, Ba] © 
[on41, Be4i1] for n > k, it is clear that 


Sk (Bn) — fe(@n) 
CE = 


Bn — On 


depends only on k and not onn whenn > k. Moreover, by (25), it is also clear 
that each c, = +1. Therefore, per cz cannot be convergent. 
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On the other hand, since f is differentiable at x, 


f(x) esgies F (Bn) — fn) 


1 
Te n — An 


(26) 


(see Exercise 7.5.7). However, since fo(y) = 0 if and only if y € Z, we also 
have fx(Bn) = fx(Qn) = 0 for k > n. It follows that f(6,) = ye, Sk (Bn) and 
fn) = we, fk (Qn). We conclude from (26) that 


foe) n-1 


—fa 
ye Slim ee =i eC ee 
= n> oo ram noo Bn — On 
is convergent, a contradiction. a 


EXERCISES 


7.5.1. Using a calculator and Theorem 7.58, approximate all real roots of f(x) = 
2x3 + x? + 4x + 1 to seven decimal places. 
7.5.2. a) Using the proof of Theorem 7.58, prove that (20) holds if r/2 
replaces r. 

Use part a) to estimate the difference |x4—z|, where x9 = 3, f(x) = 
sinx, and x, is defined by (19). Evaluate x4 directly, and verify that x4 
is actually closer than the theory predicts. 

7.5.3. Prove that given any n € N, there is a function f € C”(R) such that 
f+ (x) does not exist for any x € R. 

7.5.4. Prove that if x,_1, x, satisfy (19), then x, is the x-intercept of the tangent 
line to y = f(x) at the point (x1, f(%n_-1)). 

7.5.5. Prove that sin(1) is irrational. 

7.5.6. Suppose that f : R > R. If f” exists and is bounded on R, and there is 
an €9 > 0 such that | f’(x)| > €o for all x € R, prove that there exists a 
56 > 0 such that if | f(xo)| < 6 for some xg € R, then f has a root; that is, 
that f(c) = 0 forsomec eR. 

7.5.7. Let x € [0,1) anda, B, be defined as in Theorem 7.60. 


a) If f : (0,1) > Rand y ER, prove that 


f (Bn) = f@n) _ -(een )\(F =) 
Bn — On po Bn — x ‘ Bn — On 


+(e y) (=*+). 
X — An Bn — On 


b) If f is differentiable at x, prove that (26) holds. 


CHAPTER 8 


Euclidean Spaces 


The world we live in is at least four dimensional: three spatial dimensions 
together with the time dimension. Moreover, certain problems from engineer- 
ing, physics, chemistry, and economics force us to consider even higher dimen- 
sions. For example, guidance systems for missiles frequently require as many 
as 100 variables (longitude, latitude, altitude, velocity, time after launch, pitch, 
yaw, fuel on board, etc.). Another example, the state of a gas in a closed 
container, can best be described by a function of 6m variables, where m is 
the number of molecules in the system. (Six enters the picture because each 
molecule of gas is described by three space variables and three momentum vari- 
ables.) Thus, there are practical reasons for studying functions of more than one 
variable. 


8.1 ALGEBRAIC STRUCTURE 


For each n € N, let R” denote the n-fold cartesian product of R with itself; 
that is, 


RY? =( 45 X0,0305 9) 3.7 Rifor fH 1,2,5 5.0}: 


By a Euclidean space we shall mean R” together with the “Euclidean inner prod- 
uct” defined in Definition 8.1 below. The integer n is called the dimension of 
R”, elements x = (xj, X2,...,%X,) of R” are called points or vectors or ordered 
n-tuples, and the numbers x; are called coordinates, or components, of x. Two 
vectors x, y are said to be equal if and only if their components are equal; 
that is, if and only if x; = y; for j = 1,2,...,n. The zero vector is the vec- 
tor whose components are all zero; that is, 0 := (0,0,...,0). Whenn = 2 
(respectively, n = 3), we usually denote the components of x by x, y (respec- 
tively, by x, y, z). 

You have already encountered the sets R” for small n. R' = R is the real line; 
we shall call its elements scalars. R? is the xy-plane used to graph functions of 
the form y = f(x). And R? is the xyz-space used to graph functions of the form 
z= f(, y). 

We have called elements of R” points and vectors. In general, we make no 
distinction between points and vectors, but in each situation we adopt the inter- 
pretation which proves most useful. 

In earlier courses, vectors were (most likely) directed line segments, but our 
vectors look like points in R”. What is going on? When we call ana € R"” a 
vector, we are thinking of the directed line segment which starts at the origin 
and ends at the point a. 
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What about directed line segments which begin at arbitrary points? Two 
arbitrary directed line segments are said to be equivalent if and only if they 
have the same length and same direction. Thus every directed line segment 
V is equivalent to a directed line segment in standard position; that is, one 
which points in the same direction as V, has the same length as V, but whose 
“tail” sits at the origin and whose “head,” a, is a point in R”. If we identify V 
with a, then we can represent any arbitrary directed line segment in R” by a 
point in R”. 

Identifying arbitrary vectors in R” with vectors in standard position and, in 
turn, with points in R” may sound confusing and sloppy, but it is no different 
from letting 1/2 represent 2/4, 3/6, 4/8, and so on. (In both cases, there is an 
underlying equivalence relation, and we are using one member of an equiva- 
lence class to represent all of its members. For vectors, we are using the repre- 
sentative which lies in standard position; for rational numbers, we are using the 
representative which is in reduced form.) 

We began our study of functions of one variable by examining the algebraic 
structure of R. In this section we begin our study of functions of several variables 
by examining the algebraic structure of R”. That structure is described in the 
following definition. 


8.1 Definition. 


Let x = (%1,...,4n), Y= O1,---, yn) € R", anda ER. 
i) The sum of the vectors x and y is the vector 


X+y = t+ y1,%2 + y2,---5%n + Yn). 
ii) The difference of the vectors x and y is the vector 
X—Y := (1 — y1,X2 — y2,---,Xn — Yn)- 
iii) The product of the scalar aw and the vector x is the vector 
QX := (AX1,AX2,...,AXy). 


iv) The (Euclidean) dot product (or scalar product or inner product) of the 
vectors x and y is the scalar 


XY (= X11 + X22 + +++ +XnyYn. 


These algebraic operations are analogues of addition, subtraction, and multi- 
plication on R. It is natural to ask, Do the usual laws of algebra hold in R”? An 
answer to this question is contained in the following result. 
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8.2 Theorem. Letx, y,z € R” anda, B € R. Then 


a0=0, Ox=0, O0-x=0, Ix=x, 04+x=x, x-x=0, 
a (Bx) = B(ax) = (aB)x, a(x-y) = (@x)-y=x- (ay), 
xty=yt+x, xt+(y+z)=(k+y)+z, x-y=y-x, 
a(xt+y)=ax+ay, and x-(y+z)=x-y+x-z. 


Proof. These properties are direct consequences of Definition 8.1 and corre- 
sponding properties of real numbers. We will prove that vector addition is 
associative, and leave the proof of the rest of these properties as an exercise. 

By definition and associativity of addition on R (see Postulate 1 in 
Section 1.2), 


x+(y+z) = (1,...,%) + O1+2Z1,---.¥n + Zn) 
= (41 + O1 +21), +--+ Xn + On + 2n)) 
= (41 + y1) + 21,---, On + Yn) + Zn) = &K&+Y) +2. a 


Thus (with the exception of the closure of the dot product and the existence 
of the multiplicative identity and multiplicative inverses), R” satisfies the same 
algebraic laws, listed in Postulate 1, that R does. This means one can use instincts 
developed in high school algebra to compute with these vector operations. For 
example, just as (x — y)* = x*—2xy+y* holds for real numbers x and y, even so, 


(x—y)-(k—y) =x-x-—2x-y+y-y (1) 


holds for any vectors x, y € R”. 

In the first four chapters, we used algebra together with the absolute value to 
define convergence of sequences and functions in R. Is there an analogue of the 
absolute value for R”? The following definition illustrates the fact that there are 
many such analogues. 


8.3 Definition. 
Let x € R”. 


i) The (Euclidean) norm (or magnitude) of x is the scalar 


n 
x= | 0 lxel?. 
k=1 


ii) The ¢'-norm (read L-one-norm) of x is the scalar 


n 
xllis= >— lxel. 
k=1 
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8.3 Definition. (Continued) 


iti) The sup-norm of x is the scalar 
IXloo:= max{|x1|,..., |Xnl}- 
iv) The (Euclidean) distance between two points a, b € R” is the scalar 


dist(a, b) := |la — bl]. 


(Note: For relationships between these three norms, see Remark 8.7 below. The 
subscript oo is frequently used for supremum norms because the supremum of 
a continuous function on an interval [a, b] can be computed by taking the limit 


of ai | f (x)|Pdx)!/? as p > oo—see Exercise 5.2.8.) 


Since ||x|| = |lx|ly = |Ilxllo = |x|, when n = 1, each norm defined above is an 
extension of the absolute value from R to R”. The most important, and in some 
senses the most natural, of these norms is the Euclidean norm. This is true for 
at least two reasons. First, by definition, 


Ix? =x-x forall xeR". 


(This aids in many calculations; see, for example, the proofs of Theorems 8.5 
and 8.6 below.) Second, if A is the triangle in R* with vertices (0, 0), x := (a, b), 
and (a, 0), then by the Pythagorean Theorem, the hypotenuse of A, Va? + b?, 
is exactly the norm of x. In particular, the Euclidean norm of a vector has a 
simple geometric interpretation in R?. 

The algebraic structure of R” also has a simple geometric interpretation in 
R’ which gives us another very useful way to think about vectors. To describe 
it, fix vectors a = (a), a2) and b = (by, bz) and let P(a, b) denote parallelogram 
associated with a and b (i.e., the parallelogram whose sides are given by aandb). 
(We are assuming that this parallelogram is not degenerate—see Figure 8.1.) 
Then the vector sum of a and b, (a; + bj, a2 + bz), is evidently the diagonal 
of P(a, b); that is, a + b is the vector which begins at the origin and ends at 
the opposite vertex of P(a, b). Similarly, the difference a — b can be identified 
with the other diagonal of P(a, b) (see Figure 8.1). The scalar product of t and 
a, (fa), ta2), evidently stretches or compresses the vector a, but leaves it in the 
same Straight line which passes through 0 and a. Indeed, if tf > 0, then ta has 
the same direction as a, but its magnitude, |t| ||al|, is > or < the magnitude of 
a, depending on whether tf > 1 ort < 1. When ¢ is negative, ta points in the 
opposite direction from a but is again stretched or compressed depending on 
the size of |t|. 


Section 8.1 Algebraic Structure 285 


#Y 


FIGURE 8.1 


Using R? as a guide, we can extend concepts from R* to R". Here are five 


examples. 


1) Every (a, b) € R* can be written as (a, b) = a(1, 0) + b(0, 1). Using this as a 


2 


3 


— 


— 


guide, we define the usual basis of R” to be the collection {e;,..., e,}, where 
e; is the point in R” whose jth coordinate is 1, and all other coordinates 
are 0. Notice by definition that each x = (%1,..., x,) € R” can be written as 
a linear combination of the e;’s: 


n 
x= > xe). 
=! 


We shall not discuss other bases of R” or the more general concept of “vec- 
tor spaces,” which can be introduced using postulates similar in spirit to 
Postulate 1 in Chapter 1. Instead, we have introduced just enough alge- 
braic machinery in R” to develop the calculus of multivariable functions. 
For more information about R” and abstract vector spaces, see Noble and 
Daniel [9]. 

Note: In R? or R?, e; is denoted by i, e2 is denoted by j, and, in R?, 3 is 
denoted by k. Thus, in R?, i:= (1, 0,0), j := (0, 1, 0), and k := (0, 0, 1). 
Let t € Rand a,b € R? with b nonzero. By the geometric interpretation of 
vector addition, (ft) := a+ tb is a point on the line passing through a in the 
direction of b. Using this as a guide, we define the straight line in R” which 
passes through a point a € R” in the direction b € R"\{0} to be the set of 
points 


fa(b) := {a+ tb: t € R}. 


In particular, it is easy to see that the parallelogram P(a, b) determined by 
nonzero vectors a and b in R” can be described as 


P(a; b) := {ua+ vb: u, v € [0, 1]}. 


Fix a¥b in R’, and set y(t):=(1 —t)a+tb, for t € R. Since y(t) =a+t(b—a), 
it is evident that y describes the line £,(b — a). This line passes through the 
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points y(0) = a and y(1) = b. In fact, by the geometric interpretation 
of vector subtraction, as ¢ ranges from 0 to 1, the points w(t) trace out the 
diagonal of P(a, b) that does not contain the origin (see Figure 8.1). It begins 
at a and ends at b. Using this as a guide, we define the line segment from 
a é€ R” tob € R’ to be the set of points 


L(a:b) := {1 —fat+tb:t € (0, 1]}. 


4) The angle between two nonzero vectors a, b € R? can be computed by the 
following process. If A is the triangle determined by the points 0, a, and b, 
then the sides of A have length ||a||, ||b||, and ||a—bj|. If we let 6 be the angle 
between a and b [i.e., the angle in A at the vertex (0, 0)], then by the Law of 
Cosines (see Appendix B), 


lla — bl]? = |lall? + ||b||? — 2\Jall || bl] cos 4. 


Since Theorem 8.2 implies ||a — b||* = (a—b) - (a—b) = |lal|* — 2a-b + ||b||’, 
it follows that —2a-b = —2|lall ||b|| cos@. Since neither a nor b is zero, we 
conclude that 
a-b 
d= ; 2 
\|a]| ||| © 


Using this as a guide, we define the angle between two nonzero vectors 
a,b € R” (for any n € N) to be the number 6 ¢€ [0, 7] determined by (2). 
(Our next result, the Cauchy-Schwarz Inequality, shows that the right side 
of (2) always belongs to the interval [—1, 1]. Hence, for each pair of nonzero 
vectors a, b € R”, there is a unique angle 0 € [0, 2] which satisfies (2).) 

5) Two vectors in R? are parallel when one is a multiple of the other, and 
orthogonal when the angle, 6, between them is 7/2; that is, when a-b = 
cos @||al| ||b|| = 0. Using this as a guide, we make the following definition 
in R". 


8.4 Definition. 
Let a and b be nonzero vectors in R”. 


i) aand bare said to be parallel if and only if there is a scalar t € R such that 
a= tb. 
ii) aand bare said to be orthogonal if and only ifa-b = 0. 


Notice that the usual basis {e;} consists of pairwise orthogonal vectors; that 
is, e; -e, =O when j #k. In particular, the usual basis is an orthogonal basis. 

We note in passing that Definition 8.4 is consistent with formula (2)—see 
Exercise 8.1.4b. Indeed, if 6 is the angle between two nonzero vectors a and 
b in R”, then a and b are parallel if and only if 6 = 0 or 6 = z, and a and b are 
orthogonal if and only if 6 = 7/2. 

We shall see below that in addition to suggesting definitions for R”, the geom- 
etry of R’ can also be used to help suggest proof strategies in R”. 
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Let’s return to the analogy between R and R”. Surely, if we are going to 
develop a calculus of several variables, we need to know more about the 
Euclidean norm on R”. The next two results answer the question, How many 
properties do the absolute value and the Euclidean norm share? 

Although the norm is not multiplicative, the following fundamental inequality 
can be used as a replacement for the multiplicative property in most proofs. 
(Some authors call this the Cauchy-Schwarz—Bunyakovsky Inequality.) 


8.5 Theorem. [CAUCHY-SCHWARZ INEQUALITY]. 
If x, y € R", then 


Ix-y| < [Ixll llyll. 


STRATEGY: Using the fact that the dot product of a vector with itself is the 
square of the norm of the vector and the square of any real number is nonnega- 
tive, identity (1) becomes 0 < ||x—y||? = ||x||? —2x-y+ |ly||?. We could solve this 
inequality to get an estimate of the dot product of x - y, but this estimate might 
be very crude if ||x — y|| were much larger than zero. But x — y is only one point 
on the line ¢,(y). We might get a better estimate of the dot product x-y by using 
the inequality 


0 < |x — ty]? = (x — ty) - (K—ty) = IIx? — 2t(x-y) +: 7 lly? (3) 


for other values of t. In fact, if we draw a picture in R* (see Figure 8.2), we 
see that the norm of ||x — ty|| is smallest for the value of t which makes x — ty 
orthogonal to y; that is, when 


O=(x—-ty)-y=x-y—ty-y=x-y—‘tllyll’. 


This suggests using t = x - y/|ly|I? when y ¥ 0. It turns out that this value of t is 
exactly the one which reproduces the Cauchy—Schwarz Inequality. Here are the 
details. 


#Y 


FIGURE 8.2 
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Proof. The Cauchy—Schwarz Inequality is trivial when y = 0. If y £ 0, substi- 
tute t = (x - y)/|ly||? into (3) to obtain 


0 < IIxll* — t(x-y) = IJxll* — 


It follows that 0 < ||x||? — (x - y)7/|ly||?. Solving this inequality for (x - y)*, we 
conclude that 


(x+y)? < [IxI7Ilyll’. | 


The analogy between the absolute value and the Euclidean norm is further 
reinforced by the following result (compare with Theorem 1.7). (See also Exer- 
cise 8.1.10.) 


8.6 Theorem. Let x, y € R". Then 


i) ||x|| => 0 with equality only when x = 0, 
ii) ||ax|| = |a|||x|| for all scalars a, 
iti) [TRIANGLE INEQUALITIES]. ||x + yl] < ||x|| + |lyl| avd |lx — yl] = [xl] — lly. 


Proof. Statements i) and ii) are easy to verify. 
To prove iii), observe that by Definition 8.3, Theorem 8.2, and the Cauchy— 
Schwarz Inequality, 


Ix+yll? =(e+y)-(K+y) =x-x+2x-yty-y 
= |x|]? + 2x- yt Iyll? < [xi]? + 20x lly + lly? = Cxll + Ilyl)?. 


This establishes the first inequality in iii). By modifying the proof of Theorem 
1.7, we can also establish the second inequality in iii). a 


Notice that the Triangle Inequality has a simple geometric interpretation. 
Indeed, since ||x|| is the magnitude of the vector x, the inequality ||x + y|| < 
|x| + |ly|| states that the length of one side of a triangle (namely, the triangle 
whose vertices are 0, x, and x + y) is less than or equal to the sum of the lengths 
of its other two sides. 

For some estimates, it is convenient to relate the Euclidean norm to the 
¢!-norm and the sup-norm. 


8.7 Remark. Let x € R”. Then 


i) ||Xlloo < IIxll < V/7||Xlloo, and 
ii) [xl] < [Ixlli < J/n|xl. 
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Proof. i) Let 1 < j <n. By definition, 
2 
Ixj/? < |x|? =xf +--- +22 <n ( max jn) =n |Ix\l2; 
1<f<n 


that is, |x;| < ||x|| and ||x|| < /n||x\loo. Taking the supremum of the first of 
these inequalities, over all 1 < j <n, we also have ||x||oo < ||x'l. 


ii) Let A = {(@i, 7): 1 <i, j <nandi < j}. To verify the first inequality, 
observe by algebra that 


n 
itt = (Shs) = Ds? +2 D> [xl lxjl =x? +2 D> [xl byl. 
i=l 


(j)EA (i, jJeA 


Since aNEA |x; ||x;| = 0, it follows that \[x||2 < IIx|l7. 
On the other hand, 


o< (el — yD? = Ve Dil? —2 © Ix lxjl 


(i, j)eA i=l G,jyeA 
= n||x|° — y xi? +2 S© [xl xy] | = albxll? — [xtlt. 
(@,j)eA 
This proves the second inequality. | 


Since x-y is a scalar, the dot product in R” does not satisfy the closure property 
for any n > 1. Here is another product, defined only on R?, which does satisfy 
the closure property. (As we shall see below, this product allows us to exploit 
the geometry of R? in several unique ways.) 


8.8 Definition. 


The cross product of two vectors x = (x1, x2, x3) and y = (y, y2, y3) in R? is 
the vector defined by 


XX Y := (x2y3 — X3Y2, X31 — X13, X12 — X2)1). 


Using the usual basis i = e1, j = e2, k = e3, and the determinant oper- 
ator (see Appendix C), we can give the cross product a more easily remem- 
bered form: 


ij k 
xx y=det| x, x2 x3 
Yt Y2 Y3 
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The following result shows that the cross product satisfies some, but not all, 
of the usual laws of algebra. (Specifically, notice that although the cross product 
satisfies the distributive property, it satisfies neither the commutative property 
nor the associative property.) 


8.9 Theorem. Let x,y,z € R? be vectors and a be a scalar. Then 
i)xxx=0, xxy=-yxx, 

ii) (ax) xX y=a(x x y) = x x (ay), 

iii) xx (yt+ya=(kKxy+(xa), 


X1 x2 X3 
iv) (kx y)-zZ=x-(yxz)=det]/y1 yo y3], 
Zl 22 23 


v) xx (y XZ) = (x-Zy — (x-y)z, 
and 
vi) |x x yl? = («-x)\(y-y) — (x-y)’. 


vii) Moreover, if x x y £ 0, then the vector x x y is orthogonal to x and y. 


Proof. These properties follow immediately from the definitions. We will 
prove properties iv), v), and vii) and leave the rest as an exercise. 
iv) Notice that by definition, 


(XX y) -Z= (x23 — x3 y2)Z1 + (X3Y1 — X1Y3)Z2 + (XL Y2 — X291)z3 
= x1 (y223 — y3Z2) + x2(93Z1 — y1z3) + .x3(9122 — 221). 


Since this last expression is both the scalar x-(y x z) and the value of the deter- 
minant on the right side of iv) (expanded along the first row), this verifies iv). 


v) Since x x (y x Z) = (x1, 2,.x3) X (y2z3 — 322, ¥3Z1 — Y1Z3, Y1Z2 — 221), 
the first component of x x (y x z) is 


X21 Z2 —X2V2Z] — XZ V3Z1 ANZ Z3 = (41 Z1 +4222 +4323) V1 — (X11 +X2V24+X33)Z1- 


This proves that the first components of x x (y x z) and (x- z)y — (x- y)z are 
equal. A similar argument shows that the second and third components are 
also equal. 

vii) By parts i) and iv), (x x y)-x = —(y x x)-x = —-y- (xxx) =-y-0=0. 
Thus x x y is orthogonal to x. A similar calculation shows that x x y is orthog- 
onal to y. a 


Part vii) is illustrated in Figure 8.3. Notice that x x y satisfies the “right-hand” 
rule. Indeed, if one puts the fingers of the right hand along x and the palm of 
the right hand along y, then the thumb points in the direction of x x y. 
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FIGURE 8.3 


By (2), there is a close connection between dot products and cosines. The 
following result shows that there is a similar connection between cross products 
and sines. 


8.10 Remark. Let x, y be nonzero vectors in R? and 6 be the angle between x and 
y. Then 


Ix x yl] = [xi llyll sind. 


Proof. By Theorem 8.9vi and (2), 


IIx x yll? = (iIxll yl)” — Cx llyll cos 4)? 
= (Ixil llyl])?C. — cos” 6) = (([xll llyll)* sin? 4. = 


This observation can be used to establish a connection between cross products 
and area or volume (see Exercise 8.2.7). 


EXERCISES 
8.1.1. Let x, y,z € R”. 


a) If ||x —z|| < 4 and |ly —z|| < 2, prove that ||x — y|| < 6. 

b) If ||x|| < 3, |lyll < 4, and ||z|| < 2, prove that |x-y —x-z| < 18. 

c) If |x — y|| < 7 and ||z|| < 2, prove that |x - (y —z) —y- (x—z)| < 14. 
d) If ||2x — yl] < 4 and |ly|| < 1, prove that |||x — yl? —x- x] < 4. 

e) Ifn =3, ||x — yl] < 4, and ||z|| < 2, prove that ||x x z—y x z|| < 8. 
f) Ifn = 3, ||xl| < 1, |lyl| < 2, and ||z|| < 1, prove that ||x- (y x z)|| < 2. 
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8.1.2. 


8.1.3. 


8.1.4. 


8.1.5. 


8.1.6. 


8.1.7. 


8.1.8. 


Euclidean Spaces 


Let B := {x € R”: ||x|] < 1}. 
a) Ifa,b,c € B and 
ne (a-b)e+ (a-c)b+(c-b)a 
—_— 3 ’ 


prove that v belongs to B. 
b) Ifa, b € B, prove that 


ja-c—b-d| < ||b—cel| + lla—d|l 


for alle,d € R”. 
c) Ifa, b,c € B andn = 3, prove that 


Vla- (b xo)? + lab? <1. 


Use the proof of Theorem 8.5 to show that equality in the Cauchy— 
Schwarz Inequality holds if and only if x = 0, y = 0, or x is parallel 
to y. 

Let a and b be nonzero vectors in R”. 


a) If d(t) = a+ tb fort € R, show that for each f, 1}, t2 € R with 
ti, 2 # to, the angle between $(t1) — d(to) and o(f2) — d(fo) is O or z. 

b) If @ is the angle between a and b, show that a and b are parallel 
according to Definition 8.4 if and only if 6 = 0 or z, and that a and b 
are orthogonal according to Definition 8.4 if and only if 90 = 7/2. 


The midpoint of a side of a triangle in R? is the point that bisects that 
side (i.e., that divides it into two equal pieces). Let A be a triangle in R* 
with sides A, B, and C and let L denote the line segment between the 
midpoints of A and B. Prove that L is parallel to C and that the length 
of L is one-half the length of C. 


a) Prove that (1, 2, 3), (4, 4, 2), and (0, 5, 6) are vertices of a right triangle 
in R?. 

b) Find all nonzero vectors orthogonal to (2,0,—1) which lie in the 
plane z= y. 

c) Find all nonzero vectors orthogonal to the vector (—3, 4,2) whose 
components sum to 8. 


Let a < b be real numbers. The Cartesian product [a,b] x [a, b] is 
obviously a square in R?. Define a cube Q in R” to be the n-fold Carte- 
sian product of [a, b] with itself; that is, Q := [a,b] x --- x [a,b]. Find 
a formula of the angle between the longest diagonal of Q and any of 
its edges. Show that when n = 3, this angle is approximately 54.74 
degrees. 

a) Using Postulate 1 in Section 1.2 and Definition 8.1, prove Theo- 

rem 8.2. 
b) Prove Theorem 8.9, parts i) through iii) and vi). 
c) Prove that if x, y € R®, then ||x x y|| < [|x| lly]. 
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8.1.9. Suppose that {a;,} and {by} are sequences of real numbers which satisfy 


[o, 2) [o,2) 
Soar <oo and Sibi <O. 
k=1 k=1 


Prove that the infinite series }°?° , axb, converges absolutely. 
8.1.10. Prove that the ¢'-norm and the sup-norm also satisfy Theorem 8.6. 


8.2. PLANES AND LINEAR TRANSFORMATIONS 


A plane J7 in R? is a set of points that is “flat” in some sense. What do we mean 
by flat? Any vector that lies in J7 is orthogonal to a common direction, called 
the normal, which we will denote by b. Fix a point a € /7. Since the vector 
x —a lies in /7 for all x € [7 and since two vectors are orthogonal when their dot 
product is zero, we see that (x — a) -b = 0 for all x € JT (see Figure 8.4). 

Using this three-dimensional case as a guide, for any a, b € R” with b 4 0, we 
call the set 


TTy(a):={x € R” : (x—a)-b =0} 


the hyperplane in R” passing through a point a € R” with normal b. (We call it a 
plane when n = 3.) In particular, [7p(a) is the set of all points x such that x — a 
is orthogonal to b. 

There is nothing unique about “the normal” of a hyperplane: Any nonzero 
vector ¢ parallel to b will define the same hyperplane. Indeed, if b and ¢ are 
parallel, then, by definition, b = te for some nonzero t € R; hence (x—a)-b = 0 
if and only if (x —a)-c = 0. Nevertheless, many properties of hyperplane can be 


Ab 


II,(a) 


FIGURE 8.4 
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determined by their normals. For example, the angle between two hyperplanes 
with respective normals b and c is defined to be the angle between the normals 
bande. 

By an equation of a hyperplane JT we mean an expression of the form 
F(x) = 0, where F : R” — Ris a function determined by the following prop- 
erty: A point x belongs to /7 if and only if F(x) = 0. By definition, then, an 
equation of the hyperplane J7p (a) [i.e., the hyperplane passing through the point 
a= (q1,...,4,) with normal b = (1, ..., b,)] is given by 


b-x=b.-a. 


This form is sometimes referred to as the point-normal form. It can also be 
written in the form 


bixy + b2x2 +--+ + dnxn = d, 


where d = bya, + boa2 +--+ + byay is a constant determined by a and b (and 
related to the distance from /7p(a) to the origin—see Exercise 8.2.8). In particu- 
lar, planes in R* have equations of the form 


ax +by+cez=d. 


Notice that a “hyperplane” in R? is by definition a straight line. Just as straight 
lines through the origin played a prominent role in characterizing differentia- 
bility of functions of one variable (see Theorem 4.3), even so hyperplane-like 
objects will play a crucial role in defining differentiability of functions of several 
variables. Why hyperplane-like objects and not just hyperplanes themselves? 
Equations of hyperplanes are by definition real valued and we do not want to 
restrict our analysis of differentiable functions to the real-valued case. 

What kind of hyperplane-like objects will be rich enough to develop a general 
theory for differentiability of vector-valued functions? To answer this question, 
we make the following observation about equations of straight lines through the 
origin. (Here we use s for slope since m will be used for the dimension of the 
range space R”.) 


8.11 Remark. Let T : R — R. Then T(x) = sx for some s € Rif and only if T 
satisfies 

T(x+y)=T(x)+TQ) and T(ax) =aT(x) (4) 
forallx,y,a ER. 


Proof. If T(x) = sx, then T satisfies (4) since the distributive and commu- 
tative laws hold on R. Conversely, if T satisfies (4), set s := T(1). Then (let 
a=x), 


T(x) =T(x-1)=xT() =sx 
for allx ER. a 
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Accordingly, we introduce the following concept. 


8.12 Definition. 


A function T : R" — R" is said to be linear [notation: T ¢ £(R"; R”)] if and 
only if it satisfies 


T(xx+y)=T(x)+T(y) and Tax) =aT(x) 


for all x, y € R” and all scalars a. 


When m = 1 (i.e., when the range of T is R), we shall often drop the boldface 
notation (i.e., write T for T). 
Notice once and for all that if T is a linear function, then 


T(0) = 0. (5) 


Indeed, by definition, T(0) = T(0+0) = T(0) +T(0). Hence (5) can be obtained 
by subtracting T(0) from both sides of this last equation. Also notice that if 
F (x) = Ois the equation of a hyperplane passing through the origin, then F(x) = 
ayx, +--+ + anXn. In particular, F ¢ £(R"; R). 

Functions in £(R”; R”) are sometimes called linear transformations or linear 
operators because of the fundamental role they play in the theory of change of 
variables in R”. We shall take up this connection in Chapter 12. 

According to Remark 8.11, linear transformations of one variable [i.e., objects 
T € L(R; R)] can be identified with R by representing T by its slope s. Is there 
an analogue of slope which can be used to represent linear transformations of 
several variables? To answer this question, we use the following half page to 
review some elementary linear algebra. 

Recall that an m x n matrix B is a rectangular array which has m rows and n 
columns: 


by, bin +++ Diy 

bo, ban +++ ban 
B= [bij Imxnt= : . a . 

bin bm2 ae bmn 


For us, the entries b;; of a matrix B will usually be numbers or real-valued func- 
tions. Let B = [bijlmxn and C = [cyx]pxq be such matrices. Recall that the 
product of B and a scalar a is defined by 


aB= labij|mxn, 
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the sum of B and C is defined (when m = p and n = q) by 
B+C = [bij + cij|mxn; 


and the product of B and C is defined (when n = p) by 


n 
BC = De bc 
v=1 


mxq 


Also recall that most of the usual laws of algebra hold for addition and multipli- 
cation of matrices (see Theorem C.1 in Appendix C). One glaring exception is 
that matrix multiplication is not commutative. 

We shall identify points x = (41, .x2,...,%n) € R” with 1 x n row matrices or 
n x 1 column matrices by setting 


| 
xn) or [x]=[x1 x2 .-. Xn] := - ‘ 
bal 


where B’ represents the transpose of a matrix B (see Appendix C). Abusing 
the notation slightly, we shall usually represent the product of an m x n matrix B 
and ann x | column matrix [x] by Bx. This notation is justified, as the following 
result shows, since the function x + ~ [x] takes vector addition to matrix addi- 
tion, the dot product to matrix multiplication, and scalar multiplication to scalar 
multiplication. 


[x] =[x1 x2 


8.13 Remark. /fx, y € R” and a is a scalar, then 
Ix + y] = [x] + Ly], [x-y] =[xlly]’, and [ox] = [x]. 


Proof. These laws follow immediately from the definitions of addition and 
multiplication of matrices and vectors. For example, 


Ixt+y]=[x1+y1 x2 +92 ... Xn t+ Yn] 
=[x1 x2 ... Xlt+ ty yo --- Yn] = [x] + [yl]. a 


The following result shows that each m xn matrix gives rise to a linear function 
from R” to R”. 


8.14 Remark. Let B = [b;;] be anm x n matrix whose entries are real numbers 
and let e,..., &, represent the usual basis of R". If 


T(x) = Bx, x ER’, (6) 
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then T is a linear function from R” to R™ and the jth column of B can be obtained 
by evaluating T at e;: 


(b1;, boj,..-, mj) = T(e;), PHN Deco: (7) 


Proof. Notice, first, that (7) holds by (6) and the definition of matrix multi- 
plication. Next, observe by Remark 8.13 and the distributive law of matrix 
multiplication (see Theorem C.1) that 


T(x+y) = B[x+y] = B(x] + [y]) = Bix] + Bly] = T@) + Tty) 


for all x, y € R”. Similarly, T(@x) = B[ax] = B(a[x]) = aB[x] = aT(x) for all 
x € R” anda ER. Thus T € £(R"; R”). is] 


Remark 8.14 would barely be worth mentioning were it not the case that ALL 
linear functions from R” to R” have this form. Here, then, is the multidimen- 
sional analogue of Remark 8.11. 


8.15 Theorem. For each T € £(R"; R”) there is a matrix B = [bjj|mxn Such that 
(6) holds. Moreover, the matrix B is unique. Specifically, for each fixed T there is 
only one B which satisfies (6), and the columns of B are defined by (7). 


Proof. Uniqueness has been established in Remark 8.14. To prove existence, 
suppose that T € £(R”; R”). Define B by (7). Then 


n 
T@) =T| > xe; 
j=l 


n n 
>| xjTEe;) = Si Cig bahay) 
j=l j=l 


n n n 
S xybiy, > xpboy.-+-5 > xpbny | = BX. a 
j=l j=l j=l 


The unique matrix B which satisfies (6) is called the matrix which represents T. 


In Chapter 11 we shall use this point of view to define what it means for a 
function from R” into R” to be differentiable. At that point, we shall show that 
many of the one-dimensional results about differentiation remain valid in the 
multidimensional setting. Since the one-dimensional theory relied on estimates 
using the absolute value of various functions, we expect the theory in R” to rely 
on estimates using the norms of various functions. Since some of those functions 
will be linear, the following concept will be useful in this regard. 
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8.16 Definition. 


Let T € £(R"; R”). The operator norm of T is the extended real number 


T(x) || 
|T|| = sup 
x40 XII 


One interesting corollary of Theorem 8.15 is that the operator norm ofa linear 
function is always finite. 


8.17 Theorem. Let T € £(R"; R”). Then the operator norm of T is finite and 
satisfies 


ITO] < ITI [xl (8) 
forallx € R". 


Proof. Since T(0) = 0, (8) holds for x = 0. On the other hand, by Defi- 
nition 8.16, (8) holds for x 4 0. It remains to prove that the extended real 
number ||T|| is finite. 

Let B be the m x n matrix which represents T, and suppose that the rows of 
T are given by b;,..., b,,. By the definition of matrix multiplication and our 
identification of R” with m x 1 matrices, 


T(x) = (b} -x,..., by - x). 


If B = O, then ||T|| = 0 and (8) is an equality. If B 4 O, then, by the Cauchy— 
Schwarz Inequality, the square of the Euclidean norm of T(x) satisfies 


T(x) ||? = (by x)? +--+ Dm © x)? 
< (by |] xi)? + =~ > + (bye xl)? 


<m-max{||bj||? :1 <j <m}|Ix||? =: C |[xll’. 


Therefore, the quotients ||T(x)||/||x|| are bounded (by /C). It follows from 
the Completeness Axiom that |/T|| exists and is finite. | 


Theorem 8.17, an analogue of the Cauchy—Schwarz Inequality, will be used to 
estimate differentiable functions of several variables. If B is the matrix which 
represents a linear transformation T, we will refer to the number ||T|| as the 
operator norm of B, and denote it by || B||. (For two other ways to calculate this 
norm, see Exercise 8.2.11.) 

We close this section with an optional result which shows that under the 
identification of linear functions with matrices, function composition is taken 
to matrix multiplication. This, in fact, is why matrix multiplication is defined the 
way it is. 
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*8.18 Remark. /fT : R” > R” and U: R” => R? are linear, then so is UoT. In 
fact, if B is the m x n matrix which represents T, and C is the p x m matrix which 
represents U, then CB is the matrix which represents U o T. 


Proof. Let e1,...,e, be the usual basis of R", uy,..., uy, be the usual basis 
of R”, and w;,..., w, be the usual basis of R?. If B = [bjj|mxn represents T 
and C = [cyx]pxm represents U, then, by Theorem 8.15, 


m 
bk = CG ohbp=H=TE), PHT, 
k=1 


and 
P 
S cokWn = (Cie, ---s Cpk) = U(ug), k= 1,2,...,m. 
v=1 


Hence 


(Uo T)(e;) = UT(e;)) = U (3: bm) = YF bij Ueux) 
k=1 


k=1 
m p m m 
See (> Pea Syn] 
k=1 k=1 


for each 1 < j <n. Since this last vector is the jth column of the matrix CB, 
it follows that CB is the matrix which represents U o T. | 


EXERCISES 
8.2.1. Let a,b,c € R°. 


a) Prove that if a, b, and ec do not all lie on the same line, then an 
equation of the plane through these points is given by (x, y,z)-d = 
a-d, where 


d := (a—b) x (a—c). 


b) Prove that if ¢ does not lie on the line #(t) = ta+b, t € R, then an 
equation of the plane that contains this line and the point ¢ is given 
by (x, y,z):d =b-d, where d :=a x (b—¢). 


8.2.2. a) Find an equation of the hyperplane through the points (1, 0, 0, 0), 
(2, —1, 0, 0), (0, 5, 2, 0), and (0, —8, 0, 3). 
b) Find an equation of the hyperplane that contains the lines ¢ (t) = 
(2r, t,t, 1) and y(t) = (1—5,1-1t,t,1+1),t ER. 
c) Find an equation of the plane parallel to the hyperplane x; +... + 
Xn = Ww passing through the point (1, 2,..., 7). 
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8.2.3. 
8.2.4. 


8.2.5. 


8.2.6. 


8.2.7 |. 


8.2.8. 


Euclidean Spaces 


Find two lines in R? which are not parallel but do not intersect. 
Suppose that T « £(R"; R”) for some n,m EN. 


a) Find the matrix representative of T if T(x, y,z,w) =(x +z,x —y, 
0,-x+y+w). 

b) Find the matrix representative of T if T (x,y,z) =x+y-—z. 

c) Find the matrix representative of T if T (11, x2,...,%n) = (1, 
X1 — Xp). 

Suppose that T € £ (R"; R”) for some n,m EN. 

a) If Td, 1) = (0,4, z) and T(0, 1) = (1, 3, 0), find the matrix repre- 
sentative of T. 

b) If T(1, 1,0) = (v2, e), T(0,—1,1) = (0, —1), and T(1,1,-1) = 
(x, 1), find the matrix representative of T. 

c) If T(, 1, 1,0) = 1,7), TQ, 1, -1,0) = (7, 1), and T (0, 0, 0, —1) 
(v3, 3), find all possible matrix representatives of T. 

d) If TdU,1,0,0) = (,2,7), T(0,0,1,0) = (5,3,1), and T(0,0, 
0, -—1) = 2, z, —1), find all possible matrix representatives of T. 


Suppose that a, b,c € R® are three points which do not lie on the same 
straight line and that /7 is the plane which contains the points a, b,c. 
Prove that an equation of /7 is given by 


X—aA\ y—a2 ZL a3 
det} b} —a, bo—ay b3-—a3| =0. 
C1] —~Q@) C2—aQ C3 — 43 


This exercise is used in Appendix E. Recall that the area of a parallel- 
ogram with base b and altitude h is given by bh, and the volume of a 
parallelepiped is given by the area of its base times its altitude. 


a) Let a, b € R® be nonzero vectors and P represent the parallelogram 
{(x, y,z) =ua+vb:u, ve [0, 1]}. 


Prove that the area of P is |la x bl. 
b) Let a,b,c € R?® be nonzero vectors and P represent the paral- 
lelepiped 


{(x, y,z) =ta+ub+ vce: t,u,v € [0, 1]}. 


Prove that the volume of P is |(a x b) - ¢|. 


The distance from a point xo = (x0, yo, Zo) to a plane /7 in R? is defined 
to be 


0 xo € JT 


dist (xo, [7):= iv ere i 


8.2.9 |. 


8.2.10. 


8.2.11. 
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where Vv := (xo — X1, yo — Y1, 20 — Z1) for some (x1, y1, z1) € HT, and v 
is orthogonal to /7 (i.e., parallel to its normal). Sketch /7 and xo for a 
typical plane /7, and convince yourself that this is the correct definition. 
Prove that this definition does not depend on the choice of v, by showing 
that the distance from x9 = (xo, yo, Zo) to the plane JT described by 
ax + by+cz=dis 


|axo + byo + czo — dl 
Vae+tet+e | 


[ROTATIONS IN R?]. This exercise is used in Section *15.1. Let 


B= cos@ —sin@ 
~ |sin@  cosé 


dist (xo, IT) = 


for some 6 € R. 


a) Prove that ||B(x, y)|| = ||(x, y)|| for all (x, y) € R’. 

b) Let (x, y) € R? be a nonzero vector and ¢ represent the angle 
between B(x, y) and (x, y). Prove that cosg = cos6. Thus, show 
that B rotates R* through an angle 6. (When 6 > 0, we shall call B 
counterclockwise rotation about the origin through the angle 6.) 


For each of the following functions f, find the matrix representative of a 
linear transformation T € £ (R; R”) which satisfies 


i f(x +h) -f@)-TH)| _ 

im — 
h>0 h 

a) f(x) = (x3/3, cos 2x) 

b) f(x) = (kx, Inx, 6 — 5x — x?) 

c) f(x) = (0, 1, x, x”, x7) 


Fix T € £(R"; R”). Set 


M, := sup |/T(x)|| and 
xl]=1 


M2 := inf{C > 0: ||T(x)|| < C|lx|| for all x € R”}. 


0. 


a) Prove that M, < ||T|. 
b) Using the linear property of T, prove that if x 4 0, then 


ITOOI 
Ix 


1- 


c) Prove that Mj = M2 = ||T|. 


CHAPTER 9 


Convergence in R” 


In this chapter we generalize the concepts of limits and continuity from R to R”. 


9.1 TOPOLOGY OF R” 


If you want a more abstract introduction to the topology of Euclidean spaces, skip 
the rest of this chapter and the next, and begin Chapter 10 now. 


Topology, a study of geometric objects which emphasizes how they are put 
together over their exact shape and proportion, is based on the fundamental 
concepts of open and closed sets, a generalization of open and closed intervals. 
In this section we introduce these concepts in R” and identify their most basic 
properties. In the next chapter, we shall explore how they can be used to charac- 
terize limits and continuity without using distance explicitly. This additional step 
in abstraction will yield powerful benefits, as we shall see in Section 9.7 and in 
Chapter 11 when we begin to study the calculus of functions of several variables. 
We begin with a natural generalization of intervals to R”. 


9.1 Definition. 


Leta € R". 


i) For each r > 0, the open ball centered at a of radius r is the set of points 
B,(a) := {x € R”: |x —al] <r}. 


ii) For each r > 0, the closed ball centered at a of radius r is the set of points 


{x € R": ||x —al] <r}. 


Notice that when n = 1, the open ball centered at a of radius r is the open 
interval (a —r,a+r), and the corresponding closed ball is the closed interval 
[a —r,a+r]. Also notice that the open ball (respectively, the closed ball) cen- 
tered at a of radius r contains none of its (respectively, all of its) “boundary” 
{x : |x — al] = r}. Accordingly, we will draw pictures of balls in R* with the 
following conventions: Open balls will be drawn with dashed “boundaries” and 
closed balls will be drawn with solid “boundaries” (see Figure 9.1). 

To generalize the concept of open and closed intervals even further, observe 
that each element of an open interval / lies “inside” J (i.e., is surrounded by 
other points in /). On the other hand, although closed intervals do NOT sat- 
isfy this property, their complements do. Accordingly, we make the following 
definition. 
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FIGURE 9.1 


9.2 Definition. 


Letn EN. 


i) A subset V of R” is said to be open (in R") if and only if for every a € V 
there is an e > O such that B,(a) C V. 

ii) A subset E of R” is said to be closed (in R") if and only if ES := R"\E is 
open. 


The following result shows that every “open” ball is open. (Closed balls are 
also closed—see Exercise 9.1.2.) 


9.3 Remark. For every x € B,(a) there is an € > 0 such that B;(x) C B,(a). 


Proof. Let x € B,(a). Using Figure 9.1 for guidance, we set ¢ = r — 
|x — all. If y € B-(x), then by the Triangle Inequality, assumption, and the 


choice of e, 
lly — all < lly — x] + |x —all < e+ |[x—al] =r. 


Thus, by definition, y € B,(a). In particular, B;(x) C B,(a). a 


(Once again, drawing diagrams in R? led us to a proof valid for all Euclidean 


spaces.) 
Here are more examples of open sets and closed sets. 
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9.4 Remark. [fa € R”, then R” \{a} is open and {a} is closed. 


Proof. By Definition 9.2, it suffices to prove that the complement of every 
singleton E := {a} is open. Let x € E* and set e = ||x—all. Then, by definition, 
a ¢ B,(x), So B,(x) C E°. Therefore, E° is open by Definition 9.2. a 


Students sometimes mistakenly believe that every set is either open or closed. 
Some sets are neither open nor closed (like the interval [0, 1)). And, as the 
following result shows, every Euclidean space contains two special sets which 
are both open and closed. (We shall see below that these are the only subsets of 
R” which are simultaneously open and closed in R”.) 


9.5 Remark. For each n € N, the empty set 6 and the whole space R" are both 
open and closed. 


Proof. Since R"” = @° and % = (R")°, it suffices by Definition 9.2 to prove that 
# and R” are both open. Because the empty set contains no points, “every” 
point x € @ satisfies B,(x) C G. (This is called the vacuous implication.) There- 
fore, @ is open. On the other hand, since B;(x) C R” for all x € R” and all 
€ > 0, it is clear that R” is open. a 


It is important to recognize that open sets and closed sets behave very 
differently with respect to unions and intersections. (In fact, these proper- 
ties are so important that they form the basis of an axiomatic system which 
describes all topological spaces, even those for which measurement of distance is 
impossible.) 

9.6 Theorem. Letn €N. 
i) If {Va}aea is any collection of open subsets of R", then 
U Ve 
acA 


is open. 
ii) [If {Ve :k =1,2,..., p} isa finite collection of open subsets of R", then 


Pp 
() Ya = () VE 
k=1 ke{1,2,...,p} 


is open. 
iti) If {Ew}aea is any collection of closed subsets of R", then 


(Fe 
acd 


is closed. 
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iv) If {Ex :k =1,2,..., p} isa finite collection of closed subsets of R", then 


U Ey:= U Ex 


k=l ke{1,2,....p} 


is closed. 
v) If V is open and E is closed, then V\E is open and E\V is closed. 


Proof. i) Letx € Ue, Vo. Then x € Vy for some a € A. Since V, is open, it 
follows that there is anr > O such that B,(x) C Vy. Thus B(x) © Uge, Va; 
that is, this union is open. 

ii) Letx € (\~_, Ve. Then x € V& fork = 1,2,..., p. Since each Vj is 
open, it follows that there are numbers r; > 0 such that B,,(x) C Vy. Let 
r= min{r},...,rp}. Thenr > 0 and B,(x) C V; for allk = 1,2,..., p; that 
is, B(x) C ae | Vk. Hence, this intersection is open. 

iii) By DeMorgan’s Law (Theorem 1.36) and part i), 


(Me) =Uss 


acA acA 


is open, SO (<4 Eq is closed. 
iv) By DeMorgan’s Law and part ii), 


P : P 
k=1 k=1 
is open, so UP_, Ex is closed. 


v) Since V\E = VN ES and E\V = ENV‘, the former is open by part ii), 
and the latter is closed by part iii). a 


The finiteness hypotheses in Theorem 9.6 are crucial, even for the case n = 1. 


9.7 Remark. Statements ii) and iv) of Theorem 9.6 are false if arbitrary collec- 
tions are used in place of finite collections. 


Proof. In the Euclidean space R, 


is closed and 


is open. a 


306 Chapter 9 Convergence in R” 


To see why open sets are so important to analysis, we reexamine the definition 
of continuity using open sets. By Definition 3.19, a function f : E > R is 
continuous ata € E if and only if given e > O there isa dé > Osuch that |x—a| < 6 
and x € E imply | f(x) — f(a)| < «. Put in “ball language,” this says that f is 
continuous ata € E if and only if f(E MN Bs(a)) C Be(f (a)); that is, EN Bs(a) C 
f'(Be(f(a))). In particular, f is continuous at a ¢€ E if and only if for all 
a € R, the inverse image under f of every open ball centered at f(a) contains 
the intersection of E and an open ball centered at a. 

Intersecting the ball centered at a with E adds a complication. It would be 
simpler if the inverse image of an open ball under a continuous function just 
contained another open ball. Can we discard the set E like that? To answer 
this question, we consider two functions, f(x) = 1/x and g(x) = 1+</x—-1, 
and one open ball, (—1, 3), centered at 1. Notice that f—!(—1, 3) = (—oo, —1) U 
(1/3, co) does contain an open ball centered at a = 1 but g—'(-1,3) = [1,5) 
does not. It is merely the intersection of an open ball and the domain of g: 
g!(—1, 3) = [1, 5) = [1], 00) N(-5, 5). Evidently, we cannot discard the domain 
E of acontinuous function when restating the definition of continuity using open 
balls, unless E is open (see Exercise 9.7.3). 

Accordingly, we modify the definition of open and closed along the follow- 
ing lines. 


9.8 Definition. 
Let E CR". 


i) Aset U C Eis said to be relatively open in E if and only if there is an open 
set A such thatU = ENA. 

ii) Aset C C Eissaid to be relatively closed in E if and only if there is a closed 
set B such thatC = ENB. 


We postpone using these concepts to study continuous functions on R” until 
Chapter 9. Meanwhile, we shall use relatively open sets to introduce connectiv- 
ity, a concept which generalizes to R” an important property of intervals which 
played a role in the proof of the Intermediate Value Theorem, and which will 
be used several times in our development of the calculus of functions of sev- 
eral variables. First, we explore the analogy between relatively open sets and 
open sets. 


9.9 Remark. Let U C E CR". 


i) Then U is relatively open in E if and only if for each a € U there is anr > 0 
such that B.(a)N E CU. 

ii) If Eis open, then U is relatively open in E if and only if U is (plain old vanilla) 
open (in the usual sense). 
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Proof. i) If U is relatively open in E, then U = EA for some open set A. 
Since A is open, there is anr > 0 such that B,(a) C A. Hence, B,(a)N E Cc 
ANE=U. 

Conversely, for each a € U choose an r(a) > 0 such that B,(a)(a) E CU. 
Then U,ey Briay(a) 0 E © U. Since the union is taken over all a € U, the 
reverse set inequality is also true. Thus U,cy Briay(a) N E = U. Since the 
union of these open balls is open by Theorem 9.6, it follows that U is relatively 
open in E. 

ii) Suppose that U is relatively open in E. If E and A are open, then U = 
EMA {is open. Thus U is open in the usual sense. Conversely, if U is open, then 
EMU =U is open. Thus every open subset of E is relatively openin EZ. 


Next, we introduce connectivity. 


9.10 Definition. 
Let E be a subset of R”. 


i) A pair of sets U, V is said to separate E if and only if U and V are nonempty, 
relatively openin EL, E=UUV,andUNV=49%. 

ii) E is said to be connected if and only if E cannot be separated by any pair 
of relatively open sets U,V. 


Loosely speaking, a connected set is all in one piece (i.e., cannot be broken 
into smaller, nonempty, relatively open pieces which do not share any common 
points). 

The empty set is connected, since it can never be written as the union of 
nonempty sets. Every singleton E = {a} is also connected, since if E = 
U UV, where UM V = § and both U and V are nonempty, then E has 
at least two points. More complicated connected sets can be found in the 
exercises. 

Notice that by Definitions 9.8 and 9.10, a set E is not connected if there are 
open sets A, B such that E 1 A, EM B are nonempty, E = (EN A) U(ENB), 
and AN B = @. Is this statement valid if we replace E = (EM A) U (EN B) by 
ECAUB? 


9.11 Remark. Let E C R". If there exists a pair of open sets A,B such that 
ENA#%, ENB#%, ECAUB,and AN B =4Q, then E is not connected. 


Proof. Set U = EN Aand V = EB. By hypothesis and Definition 9.8, 
U and V are relatively open in E and nonempty. Since UNV CAN B= 4M, 
it suffices by Definition 9.10 to prove that E = U UV. But E is a subset of 
AUB,so E CUUYV. On the other hand, both U and V are subsets of E, so 
EDUUV. We conclude that E =UUV. |_| 


(The converse of this result is also true, but harder to prove—see Theorem 
9.20 below.) 


308 Chapter 9 Convergence in R” 


#Y 


FIGURE 9.2 


In practice, Remark 9.11 is often easier to apply than Definition 9.10. Here 
are several examples. The set Q is not connected: set A = (—ov, J/2) and B = 
(/2, 00). The “bow-tie-shaped set” {(x, y): -I1 < x < 1 and —|x| < y < |x}} 
is not connected (see Figure 9.2): set A = {(x, y) : x < O} and B = {(x, y): 
x > Of. 

Is there a simple description of all connected subsets of R? 


9.12 Theorem. A subset E of Ris connected if and only if E is an interval. 


Proof. Suppose that E is a connected subset of R. If E is empty or if E con- 
tains only one point c, then E is one of the intervals (c, c) or [c, c]. 

Suppose that FE contains at least two points. Set a = inf E and b = sup E, 
and observe that —co < a < b< w. Ifa € Esetaq,r = a,andifbeE 
set bk = b, k € N. Otherwise, use the Approximation Property to choose 
ay, by € E such that a, | a and by + bask — co. Notice that in all cases, 
E contains each [ax, bg]. Indeed, if not, say there is an x € [ag, bg]\E, then 
a, € EN (—o,x), by € EN (x, 00), and E C (—oo, x) U (x, oo). Hence, by 
Remark 9.11, E is not connected, a contradiction. Therefore, E D [ax, bx] for 
allk EN. It follows from construction that 


E =| Jlax, bx. 


k=1 


Since this last union is either (a,b), [a,b), (a, b], or [a, b], we conclude that 
E is an interval. 

Conversely, suppose that E is an interval which is not connected. Then 
there are sets U, V, relatively open in E, which separate E (i.e., E = UUV, 
UNV = 9), and there exist points x; ¢ U and x27 € V. We may suppose that 
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x1 < x2. Since x1, x2 € E and E is an interval, Jo := [x1, x2] C E. Define f on 
Ip by 


xEeU 


0 
f@) = 1 xev. 


Since UNV = @, f is well defined. We claim that f is continuous on Jp. 
Indeed, fix x9 € [x1], x2]. Since U UV = E 32D Jo, it is evident that x9 € U 
or x9 € V. We may suppose the former. Let yz € Jo and suppose that y, — xo 
as k — oo. Since U is relatively open, there is an e > 0 such that (xp — €, x9 + 
e)NE CU. Since y, € E and yg > xo, it follows that y, € U for large k. 
Hence f (yz) = 0 = f (xo) for large k. Therefore, f is continuous at xo by the 
Sequential Characterization of Continuity. 

We have proved that f is continuous on Jp. Hence by the Intermediate 
Value Theorem (Theorem 3.29), f must take on the value 1/2 somewhere on 
Ip. This is a contradiction, since by construction, f takes on only the values 0 
or 1. a 


We shall use this result later to prove that a real function is continuous on 
a closed, bounded interval if and only if its graph is closed and connected (see 
Theorem 9.71). 


EXERCISES 


9.1.1. Sketch each of the following sets. Identify which of the following sets 
are open, which are closed, and which are neither. Also discuss the con- 
nectivity of each set. 


a) E={(x,y):x 40} 
b) E ={(x, y): 4x? + 9y? < 1} 
c) ES Gaps 40S yl 
dy Bel Gjyy ix —2y7 = bape 
e) E = {(x,y):x*+4x + y? =O} U{(x, 0): x €[7, 9]} 
9.1.2. Letn EN, leta € R”, lets,r €¢ Rwiths <r, and set 
V ={xeR":s < |lx—all<r} and E={xeR":s < |x—all <r}. 
Prove that V is open and E is closed. 


9.1.3. a) Leta < bandc <d be real numbers. Sketch a graph of the rectangle 
[a, b] x [c,d] := {(x, y): x € [a,b], y € [e, d]}, 


and decide whether this set is connected. Explain your answers. 
b) Sketch a graph of set 


By (—2, 0) U Bi (2, 0) U {(x, 0) : -1 < x < J}, 


and decide whether this set is connected. Explain your answers. 
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9.1.4. 


9.1.5. 


9.1.6. 


9.1.7. 


9.1.8. 


Convergence in R” 


a) Set Ey := {(x,y):x =O} and Ey := {(x, y): 4x74 9y? < 1}, and 
sketch a graph of the set 


U:= [x y) 14x? +99? < J and xzo}. 


b) Decide whether U is relatively open or relatively closed in £). 
Explain your answer. 

c) Decide whether U is relatively open or relatively closed in Ep. 
Explain your answer. 


a) Let FE; denote the closed ball centered at (0, 0) of radius 1 and £2 = 
B js (1, 0), and sketch a graph of the set 


U:= [(, y):x?-+y? sands? —2x+y?-4 < ol. 


b) Decide whether U is relatively open or relatively closed in £). 
Explain your answer. 

c) Decide whether U is relatively open or relatively closed in Ep. 
Explain your answer. 


Suppose that E C R” and that C is a subset of E. 


a) Prove that if E is closed, then C is relatively closed in E£ if and only if 
C is (plain old vanilla) closed (in the usual sense). 

b) Prove that C is relatively closed in E if and only if E\C is relatively 
open in E. 


a) If A and B are connected in R” and AN B # &, prove that A U B is 
connected. 
b) If {Eu}wea is a collection of connected sets in R” and Nyca Eg # Y, 


prove that 
E=|J 


acA 


is connected. 

c) If A and B are connected in R and AN B F Q, prove that AN B is 
connected. 

d) Show that part c) is no longer true if R? replaces R. 


Let V be a subset of R”. 


a) Prove that V is open if and only if there is a collection of open balls 
{By : a € A} such that 


V = |J Ba. 


acA 
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b) What happens to this result when open is replaced by closed? 
9.1.9. Show that if E is closed in R" anda ¢ E, then 


inf ||x —al| > 0. 
xeE 


9.1.10. Graph generic open balls in R? with respect to each of the “non- 
Euclidean” norms || - ||; and || - ||.o. What shape are they? 


9.2 INTERIOR, CLOSURE, AND BOUNDARY 


To prove that every set contains a largest open set and is contained in a smallest 
closed set, we introduce the following topological operations. 


9.13 Definition. 
Let E be a subset of a Euclidean space R”. 


i) The interior of E is the set 
E° =v : VC E and V is open in R"}. 
ii) The closure of E is the set 


E=( |e : BD E and B is closed in R"}. 


Notice that every set E contains the open set # and is contained in the closed 
set R”. Hence, the sets E° and E are well defined. Also notice by Theorem 9.6 
that the interior of a set is always open and the closure of a set is always closed. 

The following result shows that E° is the largest open set contained in F, and 
E is the smallest closed set which contains E. 


9.14 Theorem. Let E C R”. Then 


1) Ao CREF, 
ii) if V is openand V C E, then V C E®, and 
ili) if C is closed and C D E, thenC 2D E. 


Proof. Since every open set V in the union defining E° is a subset of E£, it is 
clear that the union of these V’s is a subset of E. Thus £° C E. A similar 
argument establishes E C E. This proves i). 

By Definition 9.13, if V is an open subset of FE, then V C E°, andif C isa 
closed set containing E, then E C C. This proves ii) and iii). i) 


In particular, the interior of a bounded interval with endpoints a and b is 
(a, b), and its closure is [a,b]. In fact, it is evident by parts ii) and iii) that 
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E = E° if and only if E is open, and E = E if and only if E is closed. We shall 
use this observation many times below. 
Let us examine these concepts in the concrete setting R*. 


9.15 EXAMPLES. 


a) Find the interior and closure of the set E = {(x,y) : -—1 < x < 1 and 
—|x| < y < [x]}. 

b) Find the interior and closure of the set EF = B,(—2,0) U By, (2,0) U {(x, 0) : 
—-l<x<l}. 


Solution. 


a) Graph y = |x| and x = +1, and observe that E is a bow tie-shaped region 
with “solid” vertical edges (see Figure 9.2). Now, by Definition 9.2, any open 
set in R? must contain a disk around each of its points. Since E° is the largest 
open set inside E£, it is clear that 


E° = {(x, yy): -l < x < land — |x| < y < |x}. 
Similarly, 


E = {(x, y): -l<.x < land -— |x| <y < |x]}. 


b) Draw a graph of this region. It turns out to be “dumbbell shaped”: two open 
disks joined by a straight line. Thus E° = B,(—2, 0) U B, (2, 0), and 


E = B,(—2,0) U Bi (2,0) U{(x, 0): -1 <x < 1}. a 


These examples illustrate the fact that the interior of a nice enough set E in 
R’ can be obtained by removing all its “edges,” and the closure of E by adding 
all its “edges.” 

One of the most important results from Chapter 5 is the Fundamental The- 
orem of Calculus. It states that the behavior of a derivative f’ on an interval 
[a, b], as measured by its integral, is determined by the values of f at the end- 
points of [a,b]. What shall we use for “endpoints” of an arbitrary set in R”? 
Notice that the endpoints a,b are the only points which lie near both [a, b] 
and the complement of [a, b]. Using this as a cue, we introduce the following 
concept. 


9.16 Definition. 
Let E C R". The boundary of E is the set 


dE :={x eR": forallr>0, B-(K)NE A@and B,(x)N ES FG}. 


[We will refer to the last two conditions in the definition of dF by saying that 
B,(x) intersects E and E°.] 
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9.17 EXAMPLE. 
Describe the boundary of the set 


E ={(x, y): x7 + y? <9 and (x — 1)(y +2) > 0}. 


Solution. Graph the relations x* + y* = 9 and (x — 1)(y + 2) = 0 to see that 
E is aregion with solid curved edges and dashed straight edges (see Figure 9.3). 
By definition, then, the boundary of E is the union of these curved and straight 
edges (all made solid). Rather than describing dE analytically (which would 
involve solving for the intersection points of the straight lines x = 1, y = —2, 
and the circle x* + y* = 9), it is easier to describe 4 E by using set algebra. 


OE = {(x, y):x* + y* <9 and (x — 1)(y +2) = 0} 
\ {(Qx, y) 1x? + y? < 9 and (x — I(y +2) > 0} a 


It turns out that set algebra can be used to describe the boundary of any set. 


1 a 
x 
a3 
FIGURE 9.3 
9.18 Theorem. Let E C R". Then dE = E\E°. 
Proof. By Definition 9.16, it suffices to show that 

x € E if and only if B,(x)N E £9@ for allr > 0, and (1) 
x ¢ E° if and only if B,(x) N E* £4 M for all r > 0. (2) 


We will provide the details for (9) and leave the proof of (10) as an exercise. 
Suppose that x € E but that B,,(x)N E = @ for some ro > 0. Then (B,,(x))*° 
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is a closed set which contains E; hence, by Theorem 9.14iii, E (By) (x))*. It 
follows that E 9 B,,(x) = & (e.g., x ¢ E), a contradiction. 

Conversely, suppose that x ¢ E. Since (E)° is open, there is an 79 > 0 
such that B,,.(x) € (£)°. In particular, @ = B,(x)N E D> B, (x) N E for 
some ro > 0. i 


We have introduced topological operations (interior, closure, and boundary). 
The following result answers the question, How do these operations interact 


with the set operations (union and intersection)? 


9.19 Theorem. Let A, B C R”. Then 


i) (AUB)? D A°U B2, (ANB)? = APN B®, 


ii) AUB=AUB, ANBCANB, 


iti) (AUB) COAUOB, and da(ANB)COdOAUOB. 


Proof. i) Since the union of two open sets is open, A° U B® is an open subset 
of A U B. Hence, by Theorem 9.14ii, A° U B° C (AU B)?. 

Similarly, (ANB)? > A°N B°. On the other hand, ifV C ANB,thenV CA 
and V Cc B. Thus (AN B)? C A? B®. 

ii) Since A U B is closed and contains A U B, it is clear that, by Theo- 
rem 9.14iii, AU B C AUB. Similarly, AN B C ANB. To prove the reverse 
inequality for union, suppose that x ¢ A U B. Then, by Definition 9.13, there 
is a closed set E which contains A U B such that x ¢ E. Since E contains both 
A and B, it follows that x ¢ A and x ¢ B. This proves part ii). 

iii) Let x € 0(AUB); that is, suppose that B,(x) intersects AUB and (AUB)* 
for allr > 0. Since (A U B)® = ASN B‘, it follows that B,(x) intersects both 
A‘ and B° for all r > 0. Thus B,(x) intersects A and A‘ for all r > 0, or B,(x) 
intersects B and B¢ for all r > 0 (ie., x € 0A UOB). This proves the first set 
inequality in part iii). A similar argument establishes the second inequality in 
part iii). a 


The second inequality in part ili) can be improved (see Exercise 9.2.10d). 

Finally, we note (Exercise 9.2.11) that relatively open sets in E can be divided 
into two kinds: those inside E°, which contain none of their boundary, and those 
which intersect dE, which contain only that part of their boundary which inter- 
sects dE. (See Figures 15.3 and 15.4 for an illustration of both types.) 


We close this section by showing that the converse of Remark 9.12 is also true. 
This result is optional because we do not use it anywhere else. 


*9.20 Theorem. Let E C R". If there exist nonempty, relatively open sets U, V 
which separate E, then there is a pair of open sets A, B such that AN E # @, 
BOE #%, ANB=%,andECAUB. 
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Proof. We first show that 
UNV =8. (3) 


Indeed, since V is relatively open in E, there is a set Q, open in R”, such that 
V=ENQ. Since UNV = Y, it follows that U Cc Q°. This last set is closed in 
R". Therefore, 


UCH=N; 


that is, (11) holds. 
Next, we use (11) to construct the open set B. Set 


bx = inf{\|x—ul]:ueU}, xeV, and B=(|) By/2(x). 


xeV 


Clearly, B is open in R". Since 5, > 0 for each x ¢ U (see Exercise 9.1.9), 
B contains V; hence BM E D V. The reverse inequality also holds, since by 
construction BU = % and by hypothesis E = U UV. Therefore, BN E = V. 
Similarly, we can construct an open set A such that AN E = U by setting 


ey = inf{|v—yll|:veV}, yeU and A=(|) B,,/2(y). 
yeU 


In particular, A and B are nonempty open sets which satisfy E C AUB. 

It remains to prove that AN B = YJ. Suppose, to the contrary, that there is a 
point a ¢ AM B. Thena € B3,/2(x) for some x € V anda ce Bey /2(Y) for some 
y € U. We may suppose that 6, < ¢y. Then 

dx  &y 
el Se all lia yl ss ey 


Therefore, ||x — y|| < inf{||v — y|| : v € V}. Since x € V, this is impossible. We 


conclude that AN B= 9. a 
EXERCISES 
9.2.1. Find the interior, closure, and boundary of each of the following subsets 
of R. 


a) E=({l/n:neZ} 
b) E=Un1 (-2. 4) 


9.2.2. For each of the following sets, sketch E°, E, and dE. 


a) E = {(x, y): 4x7 + 9y? < 1} 
b) E={(x,y):x*+4x + y? =0} U{(@, 0): x €[7, 9} 


316 Chapter 9 


9.2.3 |. 


9.2.4. 


9.2.5. 
9.2.6. 


9.2.7. 


9.2.8. 


9.2.9. 


9.2.10. 


9.2.11. 


Convergence in R” 


LEHI) yl? 0s y= 2) 
d) E = {(x, y):x*-2y? <1,-l<y<1]} 


This exercise is used in Section 12.1. Suppose that A C B C R". 
Prove that 


ACB and A° CB? 


Let E be a subset of R”. 

a) Prove that every subset A C E contains a set B which is the largest 
subset of A that is relatively open in E. 

b) Prove that every subset A C E is contained in a set B which is the 
smallest closed set containing A that is relatively closed in E. 


Complete the proof of Theorem 9.18 by verifying (10). 

Prove that if E C R is connected, then E° is also connected. Show that 
this is false if “R” is replaced by “R?.” 

Suppose that E C R” is connected and that E C A C E. Prove that A is 
connected. 

A set A is called clopen if and only if it is both open and closed. 


a) Prove that every Euclidean space has at least two clopen sets. 

b) Prove that a proper subset E of R” is connected if and only if it con- 
tains exactly two relatively clopen sets. 

c) Prove that every nonempty proper subset of R” has a nonempty 
boundary. 


Show that Theorem 9.19 is best possible in the following sense. 


a) There exist sets A, B in R such that (A U B)? 4 A° U B®. 

b) There exist sets A, B in R such that ANB £ ANB. 

c) There exist sets A, B in R such that a(A U B) 4 dA UOB and a(AN 
B) A d0AUOB. 


Let A and B be subsets of R”. 


a) Show that 0(A N B)N (AS U (AB)*°) C AA. 

b) Show that ifx € 0(ANB) andx ¢ (ANd B)U(BNdA), thenx € dANODB. 

c) Prove that a(AN B) C (ANJB)U(BNAA)U (GAN OB). 

d) Show that even in R, there exist sets A and B such that (AM B) # 
(ANdB)U(BNdIA)U(AAN OB). 


Let E C R" and U be relatively open in E. 


a) IfU C E°, thenUN dU = 9. 
b) IfUNDE £G, thenUNdU =UNIDE. 


*9.3. COMPACT SETS 


This section requires no material from any other enrichment section. 


In this section we give a more complete description of compact sets. Most of the 
results we state are trivial to prove by appealing to the hard part of Heine—Borel 
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Theorem, specifically, that closed and bounded subsets of a Euclidean space are 
compact. Since this powerful result does not hold in some non-Euclidean spaces, 
our proofs will appeal only to the basic definition of compact sets and, hence, 
avoid using the Heine—Borel Theorem. 

We begin by expanding our terminology concerning what we mean by a 
“covering.” 


9.21 Definition. 
Let V = {Va}wea be a collection of subsets of R”, and suppose that E C R”. 
i) Vis said to cover E (or be a covering of E) if and only if 


BS. |) Vex 


acA 


(No assumption about V, being open is made.) 

ii) Let V be a covering of E. V is said to have a finite (respectively, count- 
able) subcovering if and only if there is a finite (respectively, an at most 
countable) subset Ao of A such that {Vo}aeA, covers E. 


Notice that the collections of open intervals 


rey | aad (arse) 
—, ——— and —-, —— 
K+ k+1/ Suen kK / Seen 


are open coverings of the interval (0, 1). The first covering of (0, 1) has no finite 
subcovering, but any member of the second covering covers (0, 1). Thus, an 
open covering of an arbitrary set might not have a finite subcovering. 

Our first general result about compact sets shows that every “space” contains 
compact sets. 


9.22 Remark. The empty set and all finite subsets of R" are compact. 
Proof. These statements follow immediately from Definition 9.27. The empty 
set needs no set to cover it, and any finite set H can be covered by finitely 
many sets, one set for each element in H. | 
Since the empty set and finite sets are also closed, it is natural to ask whether 
there is a relationship between compact sets and closed sets in general. The 
following three results address this question. 


9.23 Remark. A compact set is always closed. 


Proof. This result follows easily from the sequential characterization of 
closed sets (see the second paragraph in the proof of Theorem 9.28). a 
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Since {(k —1,k+1) : k € N} is an open covering of the closed set E := [1, 00), 
the converse of Remark 9.23 is false. The following result shows that this is not 
the case if E is a subset of some compact set. 


9.24 Remark. A closed subset of a compact set is compact. 


Proof. Let E be a closed subset of H, where H is compact, and suppose that 
VY = {Va}vea iS an open covering of E. Now E* = R” \ E is open. Thus 
Y U {E*‘} is an open covering of H. Since H is compact, there is a finite set 
Ao © A such that 


HCESU L) Va 


acAg 


But EM E* = §. Therefore, E is covered by {Va}wedAp- a 


Finally, we show that every open covering of a set in a Euclidean space has a 
countable subcovering. 


9.25 Theorem. [LINDELOF]. 
Letn € N and let E be a subset of R". If {Value is a collection of open sets 
and E C Uses Va; then there is an at most countable subset Ao of A such that 


Es |i). Ve: 


acAg 


Proof. Let T be the collection of open balls with rational radii and rational 
centers (i.e., centers which belong to Q”). This collection is countable. More- 
over, by the proof of the Borel Covering Lemma, JT “approximates” the col- 
lection of open balls in the following sense: Given any open ball B,(x) C R", 
there is a ball B,(a) € T such that x € B,(a) and B,(a) C B,(x). 

To prove the theorem, let x € E. By hypothesis, x ¢ V, for some a@ € A. 
Since Vy is open, there is ar > 0 such that B,(x) C Vy. Since TJ approximates 
open balls, we can choose a ball By € TJ such that x € By C Vy. The collection 
T is countable and, hence, so is the subcollection 


{U,, Uo,...}:={By 2x € E}. 


By the choice of the balls By, for each k € N there is at least one ag € A such 
that U; © Vy,. Hence, by construction, 


ECUR=UUuSU Ma. 


xeE keN keN 


Thus, set Ag := {ax : k € N}. a 
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EXERCISES 


9.3.1. 


9.3.2. 


9.3.3. 


9.3.4. 


9.3.5. 


9.3.6. 


9.3.7. 


Identify which of the following sets are compact and which are not. If E 
is not compact, find the smallest compact set H (if there is one) such that 
E CH. 


a) {1/e&:k EN}U{O, 1} 

b) {(x, y) € R?:1 <x?/a+4 y*/b < 2} for real numbers 0 < a <b 

c) {(x, y) € R* : y =cos(1/x) for some x € (0, 1]} 

d) {(x, y) €R*:|x+y| <1} 

Let A, B be compact subsets of R”. Using only Definition 9.27ii, prove 
that A U B and AN B are compact. 

Suppose that E C R is compact and nonempty. Prove that sup E, inf 
Eee. 

Suppose that {Va}vea is a collection of nonempty open sets in R” which 
satisfies V, 1 Vg = Y for alla ¥ B in A. Prove that A is countable. What 
happens to this result when open is omitted? 

Prove that if V is open in R”, then there are open balls By, Bo,..., 


such that 
Vee) By. 
JEN 
Prove that every open set in R is a countable union of open intervals. 
Letn EN. 


a) A subset E of R” is said to be sequentially compact if and only if ev- 
ery sequence x; in E has a convergent subsequence x;, whose limit 
belongs to E. Prove that every compact set is sequentially compact. 

b) Prove that every sequentially compact set is closed and bounded. 

c) Prove that a set E C R" is sequentially compact if and only if it is 
compact. 


Let H CR”. Prove that H is compact if and only if every cover {Eq}wea 
of H, where the E,’s are relatively open in H, has a finite subcovering. 


9.4 HEINE-BOREL THEOREM 


In this section, we use the theory of sequences developed above to prove the 
Heine-Borel Theorem. It is difficult to overestimate the usefulness of this pow- 
erful result, which allows us to extend local results to global ones in an almost 
effortless manner (e.g., see Example 9.29). 

We begin with the following “covering” lemma. 


9.26 Lemma. [BOREL COVERING LEMMA]. 
Let E be aclosed, bounded subset of R". If ris any function from E into (0, ov), 
then there exist finitely many points y,,..., yn € E such that 


N 


EC U Bry) (Yj): 
j=l 
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STRATEGY: Since r(y) > 0 and y € B,,y)(y) for each y ¢€ E, it is clear that 
EC yz ¢ Bry)(y). By moving the centers a little bit, we might be able to 
assume that the centers are rational; that is, that E C Uycengn Briyy(y). Since 
Q” is countable (see Theorem 1.42i and Remark 1.43), it would follow that there 
exist y; € EQ" andr; := r(y;) such that E C Uja1 B, (yj). Hence, if the 
covering lemma is false, then there exist x, € E such that x, ¢ Li 1 Br (Yj) 
for k = 1,2,.... Since E is closed and bounded, it follows from the Bolzano- 
Weierstrass Theorem and Theorem 9.37 that some subsequence x; converges 
to a point x ¢ E asv — ov. Since E is a subset of the union of balls B;;(y,), 
this x must belong to some B,,, (y jo). Hence by Theorem 9.36, x;, € B; (yj) for 
large v. But this contradicts the fact that ifk > j, then x, ¢ B,;(y;). Here are 
the details. 


Proof. Step 1: Change the centers. Fix yo € E. By Theorem 9.32, choose 
a € Q” and p := p(yo,a) € Q such that |lyo — al] < r(yo)/4 and r(yo)/4 < 
p <r(yo)/2. Since |lyo — all < r(yo)/4 < p, we have yo € B,(a). On the other 
hand, y € B,(a) implies |[yo — y|l < llyo — all + lla—yll < op +p < r(Yo); that 
is, B,(a) Cc Brvyo) (Yo). 

Step 2: Construct the sequence. We just proved that to each yo € E there 
correspond a € Q” and p(yo, a) € Q such that yo € Bovyo,ay(a) C Bryo)(Yo)- 
Since Q and Q” are countable, it follows that there exist a; € Q” and p; € Q 
such that 


CO 
EC , Bp; (aj). 
j=l 
Suppose for a moment that E is not a subset of any of the finite 
unions oem By,(aj),k EN. For each k, choose x, € E\ Wea By,(aj). By 
Theorems 9.34, 9.37, and 9.36 there is a subsequence x;, and an index jo 
such that x, € Boi (aj) for v large. But by construction, if k, > jo, then 


Xk, ¢ We Bp (aj); in particular, x,, cannot belong to Boj, (aj)) for large v. 
This contradiction proves that there is an N € N such that 


N 
EC LI) Bp; (aj)- 
=) 


Step 3: Finish the proof. By Step 1, given j € N there is a point in E, say y;, 
such that Bp, (aj) C Bry; (yj). We conclude by Step 2 that 


N N 
EC LU Bo, (aj) C U Bry j)(¥j)- - 
j=l j=l 


In conjunction with this important result, we introduce the following con- 
cepts. (For a more complete treatment, see Section 9.7.) 
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9.27 Definition. 
Let E be a subset of R”. 


i) An open covering of E is a collection of sets {Vy}ve4 such that each V, is 


open and 
Bet Ve. 


acA 


ii) The set E is said to be compact if and only if every open covering of E has 
a finite subcovering; that is, if and only if given any open covering {Va}aea 
of E, there is a finite subset Ag = {a1,..., ay} of A such that 


N 
EC | J Va,;. 
j=l 


This definition is sufficiently abstract to make students uneasy when first 
introduced to it. And with good reason! It’s not obvious whether a particu- 
lar open covering has a finite subcovering. Is there an easy way to recognize 
when a set is compact? 

For Euclidean spaces, the answer to this question is yes. In fact, we shall 
use the Borel Covering Lemma to establish the following simple but important 
characterization of compact sets. 


9.28 Theorem. [HEINE-BOREL THEOREM]. 
Let E be a subset of R". Then E is compact if and only if E is closed and 
bounded. 


Proof. Suppose that E is compact. Since {B;(0)},en is an open covering of 
R”, hence of E, there is an N € N such that 


N 
Ec U B,(0). 


k=1 


In particular, E is bounded by N. 

To verify that E is closed, suppose not. Then E is nonempty and (by 
Theorem 9.37) there is a convergent sequence x; € E whose limit x does 
not belong to E. For each y € E, set r(y) := ||x — y||/2. Since x does not 
belong to E, r(y) > 0. Thus each B,,y)(y) is open and contains y; that is, 
{B,(y)(y) : y € E} is an open covering of E. Since E is compact, we can choose 
points y; and radiir; := r(y;), for j = 1,2,..., M such that 


M 
EC\|JB,,(yj). 
j=l 
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Setr := min{r;,..., ry}. (This is a finite set of positive numbers, so r is also 
positive.) Since x; > x ask — oo, x; € B,(x) for large k. But x, € B-(K)NE 
implies xz € B; (yj) for some j € N. Therefore, it follows from the choices of 
r; and r, and from the Triangle Inequality, that 


rj = [Xk —yyll = Ik — yyll — Ilxe — xl 
= 2rj — ||xz —x|| > 2rj —r > 2r; —Trj =7j, 


a contradiction. 

Conversely, suppose that E is closed and bounded. Let {Va}ve4 be an open 
covering of FE. Let x € E. Since {Vy}ae, is an open covering of FE, there exists 
an r(x) > O such that B,(x)(x) C Vy. Thus by the Borel Covering Lemma, 
there exist finitely many points x;,..., xy and radii r; := r(x;) such that 


N 
ECS) B(x). 
j=l 


But by construction, for each j there is an index a; € A such that B, (xj) C 

Vu;- We conclude that {Va yey is a finite subcovering of E. | 

It is important to recognize that the Heine—Borel Theorem no longer holds if 
either closed or bounded is dropped from the hypothesis, even when n = 1| and 
E is an interval. Indeed, (0, 1) is bounded but not closed and 


has no finite subcovering. And [1, oo) is closed but not bounded and 


[l,oo) c LJ (1-<.n) 


neN 


has no finite subcovering. 
As promised above, we can use the Heine—Borel Theorem to extend local 
results to global ones. 


9.29 EXAMPLE. 


Suppose that E is a closed, bounded subset of R. If for every x € E there exist a 
nonnegative function f = f, and anumberr = r(x) > 0 such that f is differen- 
tiable on R, f(t) > O fort € (x —r,x+r),and f(t) =0 fort ¢ (x —2r,x + 2r), 
prove that there exist a differentiable function f and an open set V which contains 
E such that f is nonzero and bounded on E and f(x) =0 forx ¢ V. 


Proof. For each x € E, chooser = r, > O and f, => 0 such that f, is dif- 
ferentiable on R, f,(t) > O fort € I.(x) := (x —r,x +r), and f,(t) = 0 for 
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t é J-(x) := (x — 2r, x + 2r). Since {J,(x)},ez covers E, which is compact by 
the Heine—Borel Theorem, there exist finitely many x;’s in E such that 


N 
Ee | G3) 


jal 


for rj; = r(@x;). Set f = ey fc; and V = Wt J, (xj). Then f is differ- 
entiable since it is a finite sum of differentiable functions. Clearly, V contains 
E. V is open since it is a union of open intervals. If x ¢ E, then x € I, (xj) 
for some j, so fr;() > 0. Thus f(x) >O+---+ foe OS 0 for all 


x € E. Moreover, since f,, is continuous on H := (de: Lk — Tk, Xk + 7K], the 
Extreme Value Theorem implies that there are constants M, that | f,,| < Mx 
on A for all k. Thus | f(x)| < M, +---+ My =: M forall x € H D E. Finally, 
ifx ¢ V, then x ¢ J,,(xj;) for all 7. Thus f(x) =0+0+---+0=0. a 


EXERCISES 


9.4.1. Suppose that K is compact in R” and E C K. Prove that E is compact if 
and only if E is closed. 

9.4.2. Suppose that E is a bounded noncompact subset of R” and that f : E > 
(0, co). If there is a g : E — R such that g(x) > f(x) for all x € E, then 
prove that there exist x},..., x, € E such that 


N 


EC|) Bea). 
j=l 


9.4.3. Suppose that E is a compact subset of R. If for every x € E there exist a 
nonnegative function f = f; and anr = r(x) > Osuch that f isC™ on 
R, f@ = 1 fort € (x -—r,x +r), and f(t) = O fort ¢ (x — 2r,x + 2r), 
prove that there exist a differentiable function f, a nonzero constant M, 
and a bounded, open set V which contains FE such that 1 < f(x) < M for 
allx €¢ E and f(x) =Oforx ¢ V. 

9.4.4. Suppose that K is compact in R” and that for every x € K there is an 
r = r(x) >Osuch that B,(x) M K = {x}. Prove that K is a finite set. 

9.4.5. Let E be closed and bounded in R, and suppose that for each x € E there 
is a function f,, nonnegative, nonconstant, increasing, and C® on R, such 
that f,(x) > Oand f/(y) = 0 for y ¢ E. Prove that there exists a nonneg- 
ative, nonconstant, increasing C® function f on R such that f(y) > 0 for 
ally € Eand f’(y) =Ofor all y ¢ E. 

9.4.6. Suppose that f: R” > R” and that a € K, where K is a compact, con- 
nected subset of R”. Suppose further that for each x € K there is a 6, > 0 
such that f(x) = f(y) for all y € Bs, (x). Prove that fis constant on K; that 
is, ifa € K, then f(x) = f(a) forallx eK. 
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9.4.7. Define the distance between two nonempty subsets A and B of R” by 
dist(A, B) := inf{|Ix —y||:x eA and ye B}. 


a) Prove that if A and B are compact sets which satisfy AM B = @, then 
dist(A, B) > 0. 

b) Show that there exist nonempty, closed sets A, B in R* such that 
AN B=9 but dist(A, B) = 0. 


9.4.8. Suppose that E and V are subsets of R with E bounded, V open, and 
E CV. Prove that there is a C™ function f : E — Rsuch that f(x) > 0 
for x ¢ E and f(x) =Oforx ¢ V. 


9.5 LIMITS OF SEQUENCES 


Using the analogy between the Euclidean norm and the absolute value, we can 
define what it means for a sequence in R” to be convergent, bounded, or Cauchy 
in the following way. 


9.30 Definition. 
Let {xx} be a sequence of points in R”. 
i) {x;} is said to converge to some point a € R” (called the limit of x;,) if and 
only if for every « > 0 there is an N € N such that 


k>N_ implies ||x, —all < e. 


Notation: x; > aask > oo or a = limg_y oo X. 

ii) {x} is said to be bounded if and only if there is an M >0 such that ||x;|| <M 
for allk EN. 

iti) {xz} is said to be Cauchy if and only if for every « > 0 there is an N ¢ N 
such that 


k,m>N imply ||xp —Xmll <e. 


The following result shows that to evaluate the limit of a specific sequence in 
R” we need only take the limits of the component sequences. 


9.31 Theorem. Leta = (a, d0,...,dy) € R" and {x;, = Ges ee angie Thee 
be a sequence in R". Then x, — a, ask — o, if and only if for each j € 
{1,2,...,n}, the component sequence a? > ajask > ow. 


Proof. Fix j € {1,...,2}. By Remark 8.7, 


j £ 
x — aj| < [xe —all < Vn max |x? — aryl. 
1<f<n 
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Hence, by the Squeeze Theorem, ied —> aj ask — oo if and only if ||x; — all 
— Oask — oo. Since ||x; —al| > Oif and only if x, > a, as k > oo, the proof 
of the theorem is complete. | 


This result can be used to obtain the following analogue of the Density of 
Rationals (Theorem 1.18). 


9.32 Theorem. Let Q” := {x € R" : xj € Q for j = 1,2,...,n}. For each 
a € R” there is a sequence xx € Q" such that x, > aas k — oo. 


Proof. Let a := (a1,...,dn)) € R". For each 1 < j <_ n, choose by 
Theorem 1.18 sequences pe € Q such that re — a; (in R) ask > o. 


By Theorem 9.31, x, := Gas ies ie. converges to a (in R”) as k > ov. 
Moreover, xx € Q” for each k EN. | 


A set E is said to be separable if and only if there exists an at most countable 
subset Z of E such that to each a € E there corresponds a sequence x, € Z such 
that x, ~ aask —> ov. Since Q” is countable (just iterate Theorem 1.421), it 
follows from Theorem 9.32 that R” is separable. 

Theorem 9.32 illustrates a general principle. As long as we stay away from 
results about monotone sequences (which have no analogue in R” whenn > 1), 
we can extend most of the results found in Chapter 2 from R to R”. Since the 
proofs of these results require little more than replacing |x — y| in the real case 
by ||x — y|| in the vector case, we will summarize what is true and leave most of 
the details to the reader. 


9.33 Theorem. Letn <N. 


i) A sequence in R" can have at most one limit. 
ii) If {xx}cen is a sequence in R" which converges to a and {Xx;}jen is any subse- 
quence of {Xx }ken, then x_, converges to aas j > ©. 
iii) Every convergent sequence in R" is bounded, but not conversely. 
iv) Every convergent sequence in R" is Cauchy. 
v) If {xx} and {y,} are convergent sequences in R" and a € R, then 


lim (x, + yx) = lim xz + lim yx, 
k->co k>0o k-oo 
lim (@x,) =a@ lim xx, 
k>oo k>oo 
and 
lim (xx - yx) = (lim xx) - (lim yg). 
k>0o k>oo k>oo 
Moreover, when n = 3, 


lim (xg x yx) = (lim xx) x (lim yg). 
k>co k>0o k->oo 
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Notice once and for all that (since ||x;||? = x, - xx), the penultimate equation 
above contains the following corollary. If x, converges, then 


lim ||xx|| = || lim xl. 
k->oo k> oo 


As in the real case, the converse of part iv) is also true. In order to prove that, 
we need an n-dimensional version of the Bolzano—Weierstrass Theorem. 


9.34 Theorem. [BOLZANO-—WEIERSTRASS THEOREM FOR R"]. 
Every bounded sequence in R" has a convergent subsequence. 


Proof. Let x; := Gn x ae e be bounded in R”. Then, by Remark 
8.71, the real sequence (x Jean is bounded in R for each j = 1,2,...,7. 

Let j = 1. By the one-dimensional Bolzano—Weierstrass Theorem, there is 
a sequence of integers 1 < k(1, 1) < k(1, 2) <--- anda number qa; such that 
at vy) > 41 aS V > ©. 

Let j = 2. Again, since the sequence Ge yy }veN is bounded in R, there is a 


subsequence {k(2, v)}yen Of {k(1, v)},en and a number a» such that oe > 
az as v + oo. Since {k(2, v)}ven is a subsequence of {k(1, v)},en, we also have 
od — a, aS v — oo. Thus, ae >aasv —> ooforalll <@< j =2. 

Continuing this process until 7 = n, we choose a subsequence k, = k(n, v) 
and points az such that 


lim ye =a 

v> oo v, 
forl < €< j =n. Seta = (a,a2,...,a,). Then, by Theorem 9.31, x;, 
converges to a as v > oo. a 


Since the Bolzano—Weierstrass Theorem holds for R", we can modify proof 
of Theorem 2.29 to establish the following result. 


9.35 Theorem. A sequence {xx} in R" is Cauchy if and only if it converges. 


Thus sequences in R” behave pretty much the same as sequences in R. We 
now turn our attention to something new. How does the limit of sequences 
interact with the topological structure of R”? Answers to this question contain 
a surprising bonus. The ¢«’s begin to disappear from the theory. 


9.36 Theorem. Let x, €¢ R”. Then x, > aas k — o if and only if for every 
open set V which contains a there isan N € N such that k > N implies x, € V. 


Proof. Suppose that x; — a and let V be an open set which contains a. By 
Definition 9.2, there is an ¢ > O such that B,(a) C V. Given this e¢, use 
Definition 9.30 to choose an N € N such that k > N implies x; € B,(a). By 
the choice of e, x, € V forallk > N. 
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Conversely, let ¢ > 0 and set V = B,(a). Then V is an open set which 
contains a; hence, by hypothesis, there is an N € N such that k > N implies 
x, € V. In particular, ||x, — al] < e« forallk > N. | 


This is a first step toward developing a “distance-less” theory of convergence. 
The next result, which we shall use many times, shows that convergent sequences 
characterize closed sets. 


9.37 Theorem. Let E C R". Then E is closed if and only if E contains all its 
limit points; that is, if and only if x, € E and xx, — x imply thatx € E. 


Proof. The theorem is vacuously satisfied if E is the empty set. 

Suppose that E #4 9 is closed but some sequence x, € E converges to a 
point x € E*. Since E is closed, E° is open. Thus, by Theorem 9.36, there is 
an N &€ Nsuch that k > N implies x; € E‘, a contradiction. 

Conversely, suppose that E is a nonempty set which contains all its limit 
points. If E is not closed, then, by Remark 9.5, E # R" and by definition E° 
is nonempty and not open. Thus, there is at least one point x € E° such that 
no ball B,(x) is contained in E°. Let x, € Bi/x(x) NE fork = 1,2,.... Then 
xx € E and ||xz, — x|| < 1/k for all k € N. Now by the Squeeze Theorem, 
Ix, — x|| > 0 (ie., x, > x ask — oo). Thus, by hypothesis, x € EF, a 
contradiction. a 


EXERCISES 


9.5.1. Using Definition 9.301, prove that the following sequences converge. 


1 1 
a) m= (7.7 +1) 


7 3 
b) m= (* 1 cosk ) 


k ’ Ok 
c) xg = (log(k + 1) — logk, e*) 


9.5.2. Using limit theorems, find the limit of each of the following vector 
sequences. 


C. Ae k= 1 
a) x= : 
RN 2 ASE SS 


1 
b) x = (sin Ee 1, cos 24 ) 


1 
c) x = (Gees — Jk? +t} 
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9.5.3. Suppose that x; — 0 in R” as k > oo and that y; is bounded in R". 


a) Prove that x; - yz > Oask > oc. 
b) Ifn = 3, prove that x, x yy > Oask > oo. 


9.5.4. Suppose that a € R”, that x, — a, and that x; —y; — 0, as k — oo. Prove 
that yy > aask > o. 


9.5.5. a) Prove Theorem 9.33: and ii. 
b) Prove Theorem 9.33iii and iv. 
c) Prove Theorem 9.33v. 
d) Prove Theorem 9.35. 


9.5.6. Let E be a nonempty subset of R”. 


a) Show that a sequence x, € E converges to some point a € E if and 
only if for every set U, which is relatively open in E and contains a, 
there is an N € N such that x, e U fork > N. 

b) Prove that a set C C E is relatively closed in E if and only if the limit 
of every sequence xx € C which converges to a point in E satisfies 
limg_so0 KR EC. 


9.5.7. a) A subset E of R” is said to be sequentially compact if and only if every 
sequence x; € E has a convergent subsequence whose limit belongs to 
E. Prove that every closed ball in R” is sequentially compact. 
b) Prove that R” is not sequentially compact. 


9.5.8. a) Let E be a subset of R”. A point a € R” is called a cluster point of E if 
EO B,(a) contains infinitely many points for every r > 0. Prove that 
ais a cluster point of E if and only if for eachr > 0, EM B,(a)\{a} is 
nonempty. 

b) Prove that every bounded infinite subset of R” has at least one clus- 
ter point. 


9.6 LIMITS OF FUNCTIONS 


We now turn our attention to limits of functions. By a vector function (from n 
variables to m variables) we shall mean a function f of the formf : A —> R”, 
where A C R” and m,n are fixed positive integers. Since f(x) € R” for each 
x € A, there are functions f; : A — R (called the coordinate or component 
functions of f) such that f(x) = (fi (x), ..., fin(x)) for each xx € A. Whenm = 1,f 
has only one component and we shall call f real valued. Sometimes, to emphasize 
the fact that a function is real valued (as opposed to vector valued), we will 
denote real functions without boldface (i.e., f : R” > R). 

Iff = (fi,.-., fm) is a vector function where the /;’s have intrinsic domains 
(e.g., the f;’s might be defined by formulas), then the maximal domain of f is 
defined to be the intersection of the domains of the f;’s. The following examples 
illustrate this idea. 
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9.38 EXAMPLES. 


1) 


Find the maximal domain of 


f(x, y) = dog(xy — y + 2x — 2), ,/9 — x? — y?). 


ii) Find the maximal domain of 
g(x, y) = (V1 — x?, log(x” — y), sin x cos y). 

Solution. 

i) This function has two components: fi(x, y) = log(xy — y + 2x — 2) and 
fo(x, y) = V9 — x2 — y?. Since the logarithm is real valued only when its 
argument is positive, the domain of f; is the set of points (x, y) which satisfy 

O<xy-—yt2x-2=(*-104+2). 
Since the square root function is real valued if and only if its argument 
is nonnegative, the domain of f2 is the set of points (x, y) which satisfy 
x? + y? < 9, Thus the maximal domain of f is 
{(x,y):x7+y*<9 and («—1)(y +2) > 0}. 
(This set was shown in Figure 9.3.) 
ii) This function has three component functions: g) (x, y) = V1 — x2, g2(x, y) = 


log(x* — y?), and g3(x, y) = sinx cosy. gj is real valued when 1 — x? > 0; 
that is, —1 < x < 1. go is real valued when x? — y? > 0; that is, when 
—|x| < y < |x|. The domain of g3 is all of R*. Thus the maximal domain 
of g is 


{(x, y): -l <x <1land-— |x| < y < |x|}. 


(This set was shown in Figure 9.2.) a 


To set up notation for the algebra of vector functions, let E C R” and suppose 


that f,g : E — R”. For eachx € E, the scalar product of ana € R with f is 
defined by 


(af)(x) := af(x), 


the sum of f and g is defined by 


(f+ g)(x) := f(x) + g(x), 


the (Euclidean) dot product of f and g is defined by 


(f- g)(x) := f(x) - g(x), 
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and (when m = 3) the cross product of f and g is defined by 
(f x g)(x) := f(x) x g(x). 
(Notice that when m = 1, the dot product of two functions is the pointwise 
product defined in Section 3.1.) 


Here is the multivariable analogue of two-sided limits (compare with 
Definition 3.1). 


9.39 Definition. 


Let n,m € N anda € R’, let V be an open set which contains a, and suppose 
that f: V \ {a} — R”. Then f(x) is said to converge to L, as x approaches a, if 
and only if for every « > 0 there is a 6 > 0 (which in general depends on e, f, 
V, and a) such that 

0 < ||Ix—al| <6 implies ||f(x) —L|| <e. 


In this case we write f(x) — Las x — aor 


L = lim f(x) 
x-a 


and call L the limit of f(x) as x approaches a. 


Using the analogy between the norm on R” and the absolute value on R, we 
can extend much of the theory of limits of functions developed in Chapter 3 to 
the Euclidean space setting. Here is a brief summary of what is true. 


9.40 Theorem. Leta € R”, let V be an open set which contains a, and suppose 
that f, g: V \ {a} > R"™. 
i) If f(x) = g(x) forall x € V \ {a} and if f(x) has a limit as x — a, then g(x) has 
a limit as x — a, and 


lim g(x) = lim f(x). 
x-a xa 
ii) [SEQUENTIAL CHARACTERIZATION OF Limits]. L = limy.af(x) exists if 
and only if f(x.) — Las k — oo for every sequence x, € V \ {a} which 
converges to aas k + ov. 


iii) Suppose that a € R. If f(x) and g(x) have limits, as x approaches a, then so 
do (f+ g)(x), (af)(x), (f- g)(x), and || f (x)||. In fact, 


lim (f + g) (x) = lim f(x) + lim g(x), 
x—-a x-a x~a 

lim (af) (x) = a lim f(x), 

x—-a x—~a 


lim (f-g) (x) = (lim fcx)) - (Jim go) , 
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and 
| lim f(x) | = lim ||f(x)|I. 
x-a x—a 
Moreover, when m = 3, 
lim (f x g)(x) = (Jimfcx)) x (Jimgx)) ; 
x—a xa x—a 
and when m = | and the limit of g is nonzero, 
lim f(x)/g(x) = (Jimtcx)) / (tim g(x)) ' 
xa xa x>a 


iv) [SQUEEZE THEOREM FOR FUNCTIONS]. Suppose that f, g,h: V \ {a} > Rand 
that g(x) < h(x) < f(x) forallx € V \ {a}. If 


lim f(x) = lim g(x) = L, 
x-a x—-a 
then the limit of h also exists, as x — a, and 


lim h(x) = L. 
x—a 


v) Suppose that U is open in R™, that L € U, and thath : U > R? for some 
DEN. JfL = limy_,g g(x) and his continuous at L. Then 


Jim (ho g)(x) = h(L). 


How do we actually compute the limit of a given vector-valued function? 
The following result shows that evaluation of such limits reduces to the real- 
valued case (i.e., the case where the range is one dimensional). Consequently, 
our examples will be almost exclusively real-valued. 


9.41 Theorem. Leta € R”, let V be an open set which contains a, and suppose 
thatf = (fi,..., fm): V \ {a} > R”. Then 


jim f(x) = Li= (Lj, Lo,..., Lm) (4) 
exists in R” if and only if 
lim fj00) = L; (5) 
exists in R for each j = 1,2,...,m. 
Proof. By the Sequential Characterization of Limits, we must show that for 


all sequences x, € V \ {a} which converge to a, f(x.) ~ Lask > wif 
and only if f;(x,) > Lj, ask — oo, for each 1 < j < n. But this last 
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statement is obviously true by Theorem 9.31. Therefore, (1) holds if and only 
if (2) holds. a 


Using Theorem 9.40, it is easy to see that if f; are real functions continuous 
at points a;, for j = 1,2,...,n, then F(x1, x2,...,%n) := fi(v1) + fo(v2) +--+ 
Fn(Xn) and G(x1, X2,..-,Xn) = fi (%1) fo(x2)--- fy(%n) both have limits at the 
point a := (a), a2,..., dy). In fact (see Exercise 9.6.6), 


lim F(x) = F(a) and lim G(x) = Gea). 


This observation is often used in conjunction with Theorem 9.41 to evaluate 
simple limits. 
9.42 EXAMPLES. 
i) Find 
lim  (xy+1,e” +2). 
(x,y) (0,0) 
ii) Prove that the function 


2+x-—y 


FOM= Tare 3y 


has a limit as (x, y) > (0, 0). 


Solution. 


i) By Theorem 9.41, this limit is (0 + 1, e® + 2) = (1, 3). 
ii) The polynomial 2 + x — y (respectively, 1 + 2x? + 3y”) converges to 2 
(respectively, to 1) as (x, y) > (0, 0). Hence, by Theorem 9.40, 
2+x—-y 2 


lim =-=2. a 
(x,y) >(,0) 1+2x2+3y2 1 


The application of Theorem 9.40 in Example 9.42ii is legitimate because the 
limit quotient was not of the form 0/0. For the multidimensional case, |’ H6pital’s 
Rule does not work (see the paragraph following Example 9.44). Hence, prov- 
ing that a limit of the form 0/0 exists in several variables often involves showing 
that the absolute value of the function minus its supposed limit, | f(x) — L], is 
dominated by (i.e., less than or equal to) some nonnegative function g which 
satisfies g(x) > 0 as x — a. Here is a typical example. 


9.43 EXAMPLE. 
Prove that 
3x7y 
f@,y= ea? 


converges as (x, y) > (0, 0). 
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Proof. Since the numerator is a polynomial of degree 3 (see Exercise 9.6.4) 
and the denominator is a polynomial of degree 2, we expect the numerator to 
overpower the denominator; that is, the limit to be 0 as (x, y) > (0,0). To 
prove this, we must estimate f(x, y) near (0,0). Since 2|xy| < x? + y? for all 
(x, y) € R’, itis easy to check that 


3 
If@, Is pel 214 


for all (x, y) € (0,0). Let ¢« > O and set 6 = ¢/2. If 0 < ||(x, y)|] < 6, then 
| f(x, y)| < 2|x| < 2\|(x, y)|| < 26 = e. Thus, by definition, 


lim (x, y) =0. | 
(x,y) (0,0) Py 


It is important to realize that by Definition 9.39, if f converges to L as x — a, 
then || f(x) — L|| is small for all x near a. In particular, f(x) ~ Lasx —> a, 
no matter what path x takes. The next two examples show how to use this 
observation to prove that a limit does not exist. 


9.44 EXAMPLE. 
Prove that the function 


2xy 
fay= Sey, 


has no limit as (x, y) > (0, 0). 


Proof. Let g(x, y) = 2xy and h(x, y) = x7+ y? and suppose that f has a limit 
L, as (x, y) > (0,0). If (x, y) approaches (0,0) along a vertical path (e.g., if 
x =Oand y > 0, y 4 0), then g(0, y) = 0 but A(O, y) 4 0 so L would have 
to be 0. On the other hand, if (x, y) approaches (0,0) along a “diagonal” path 
(e.g., if y = x andx > 0, x £ 0), then g(x, x) = h(x, x) = 2x? so L would 
have to be 1. Since 0 # 1, f has no limit at (0,0). | 


Recall that if f is a function of two variables then f,; denotes the partial 
derivative of f with respect to x and f, denotes the partial derivative of f with 
respect to y. Sometimes students guess that an analogue of |’H6pital’s Rule 
holds for R?; for example, that 


om Oi BD) + By) 
(x,y) > (a,b) h(x, y) (x,y) > (a,b) hy (x, y) + hy(x, y) ; 


Example 9.44 shows that this guess is wrong. Indeed, f = g/h satisfies g, +g) = 
2x + 2y = hy + hy, but the limit of f is NOT (2x + 2y)/(2x + 2y) = 1. It has 
no limit. In particular, be careful about applying one-dimensional results to 
functions of several variables unless the analogue has been proved. 
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In the solution to Example 9.44, the diagonal path was chosen so that the 
denominator of f(x, y) would collapse to a single term. This same strategy is 
used in the next example. 


9.45 EXAMPLE. 


Determine whether 


xy* 


LOSS) Seah 


has a limit as (x, y) > (0, 0). 


Solution. The vertical path x = 0 gives f(0, y) = 0 even before we take the 
limit as y + 0. On the other hand, the parabolic path x = y* gives 


fry) = = 5 #0. 
Therefore, f cannot have a limit as (x, y) > (0, 0). a 
[Notice that if y = mx, then 
2,3 


m-x 


f@,y= 0 


= Ly, 
x2 + m‘4x4 


as x — 0. Thus, Example 9.45 shows that the two-dimensional limit of a function 
might not exist even when its limit along every linear path exists and gives the 
same value. | 

When asked whether the limit of a function f(x) exists, it is natural to begin 
by taking the limit as each variable moves independently. Comparing Exam- 
ples 9.42 and 9.44, we see that this strategy works for some functions but not all. 
To look at this problem more closely, we introduce the following terminology. 
Let V be an open set in R’, let (a,b) € V, and suppose that f: V \{(a, b)} > R”. 
The iterated limits of f at (a, b) are defined to be 


lim lim f(x, y):= lim (sim te, ») and lim lim f(x, y):= lim (Jim tex, y)), 
xa yb yobx->a yb xa 


xa yb 


when they exist. 

The iterated limits of a given function might not exist. Even when they do, 
we cannot be sure that the corresponding two-dimensional limit exists. Indeed, 
although the iterated limits of the function f in Example 9.44 above exist and 
are both zero at (0,0), f has no limit as (x, y) > (0, 0). 

It is even possible for both iterated limits to exist but give different values. 
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9.46 EXAMPLE. 
Evaluate the iterated limits of 
2 
x 
fay= pay 
at (0,0). 


Solution. For each x £0, x?/(x* + y*) > Las y > 0. Therefore, 


et ee x yg 
lim lim = lm —=1. 
x0 y>0 x2 + y2 x30 x2 
On the other hand, 
2 
0 
lim lim — =i). 


= 1m +> 
y>0x 0 x24 y? y>0 y? 
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This leads us to ask, When are the iterated limits equal? The following result 
shows that if f has a limit as (x, y) > (a, b) and both iterated limits exist, then 


these limits must be equal. 


9.47 Remark. Suppose that I and J are open intervals, thata € I and b € J, and 


that f : I x J) \ {(a, b)} — R. If 


g(x) := lim f(x, y) 
yb 


exists for each x € I \ {a}, if limy+a f(x, y) exists for each y € J \ {b}, and if 


f(x, y) > Las (x, y) = (a,b) (in R’), then 


L= lim lim f(x, y) = lim lim f(z, y). 
yobx->a 


Xd y>b 
Proof. Let ¢ > 0. By hypothesis, choose 6 > 0 such that 


0 < ||, y)—(a,b)|| <6 implies |f(x,y)—L| <e. 


Suppose that x € J and that 0 < |x —a| < 6//2. Then for any y which satisfies 


0 < |y —b| < 5/V2, we have 0 < ||(x, y) — (a, b)|| < 5; hence 


Ig(x) — L| = lg@) — F@, wI+1F@, y) — LI < lg@) — f@. yl +e. 


Taking the limit of this inequality as y —> b, we find that |g(x) — L| < e for all 
x € I which satisfy 0 < |x —a| < 5/2. It follows that g(x) > Lasx > a; 


that is, 


L= lim lim f(x, y). 
xa y>b 
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A similar argument proves that the other iterated limit also exists and 
equals L. a 


Notice by Example 9.46 that the conclusion of Remark 9.47 might not hold if 
the hypothesis “f(x, y) > L as (x, y) > (a, b)” is omitted. In particular, if the 
limit of a function does not exist, we must be careful about changing the order 
of an iterated limit. 


EXERCISES 


9.6.1. For each of the following functions, find the maximal domain of f, prove 
that the limit of f exists as (x, y) > (a, b), and find the value of that limit. 
(Note: You can prove that the limit exists without using e’s and 6’s — see 
Example 9.42.) 


a) f(x, y) = (x= 1). =a.-0 
y 


= sin x 


b) f(x, y) = ,x* +2cosx + y’, tan — *), (a,b) = (0, 1) 
y 


Vvixy| 
te , x2 + y? 


ee b) = (0, 0) 


Cee | (xy)? + Qty? aeaye Ged 1)? 


d) f 
iG) = y2+2)>  x2+y4+2x —2y2+2 


fe) 
— 
_= 
& 
= 
II 
ue a. a 
%S 
+ 


), (a,b) = (-1,1) 


9.6.2. 


9.6.3. 


9.6.4. 


9.6.5. 


9.6.6. 


9.6.7. 


9.6.8. 
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Compute the iterated limits at (0,0) of each of the following functions. 
Determine which of these functions has a limit as (x, y) > (0,0) in R?, 
and prove that the limit exists. 
1 —cosx cos y 
a) f(x,y) = aay 
e+y3 
b = =~ 
) f@, y) x3 4+ 3y3 
2x + y 1 
c) f@, y) = (4x2 4 y2"* < 2 
Prove that each of the following functions has a limit as (x, y) > (0, 0) 
8x3 — 27y3 
a) fay) = “Agee Oye” (x, y) € (0,0) 
|x|*y? 
b) SQ, y) = x 4 y2’ (x, y) x (0, 0), 


where a is ANY positive number. 
A polynomial on R" of degree N is a function of the form 


Nj Nn 


ji ji 
P(x1,%2,...,Xn) = ) see ) aji,..., inX] sean ys 


A=90 in=9 


a jn are scalars, Nj,..., Nn are nonnegative integers, and N = 
Ni + No+---+ Nn. Prove that if P is a polynomial on R” anda € R”, 
then lim, ,, P(x) = P(a). 

Suppose that a € R”, that L € R”, and that f: R” — R”. Prove that if 
f(x) — Lasx — a, then there is an open set V containing a and a constant 
M > Osuch that ||f(x)|| < M for allx € V. 

Suppose that a = (a,...,dn) € R”, that f; :R — Rfor j = 1,2,...,n, 
and that g(x1, x2,...,%n) := fi(x1) +++ frm). 


a) Prove that if fj(t) > fj(aj) ast — aj, for each j = 1,...,n, then 
g(x) > filai)-++ fran) as x > a. 

b) Show that the limit of g might not exist if, even for one /j, the hypoth- 
esis “fj (t) > fj(aj)” is replaced by “ f;(t) > L;” for some L; € R. 


Suppose that g : R > Ris differentiable and that g’(x) > 1 for allx € R. 
Prove that if g(2) = 0 and f(x, y) = (x — 2)?(y? — 1)/y> g(x), then there 
is an L € Rsuch that f(x, y) > L as (x, y) > (2, d) for all b € R\{0}. 


a) Prove Theorem 9.40i. 

b) Prove Theorem 9.40i1. 
c) Prove Theorem 9.40iii. 
d) Prove Theorem 9.40iv. 
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9.7. CONTINUOUS FUNCTIONS 


In this section we define what it means for a vector function to be continuous, 
obtain analogues of many results in Sections 3.3 and 3.4, and examine how open 
sets, closed sets, and connected sets behave under images and inverse images by 


continuous functions. We shall use these results many times in the subsequent 
chapters. 


9.48 Definition. 
Let E be a nonempty subset of R” andf: E > R”. 


i) fis said to be continuous ata € E if and only if for every ¢ > 0 there is a 
6 > 0 (which in general depends on «, f, E, and a) such that 


|x —al| <6 and xeE imply |lf(x) —f(a)|| <.«. (6) 


ii) fis said to be continuous on E (notation: f : E > R” is continuous) if and 
only if f is continuous at every x € E. 


Suppose that E is a nonempty subset of R”. It is easy to verify that f is con- 
tinuous at a ¢€ E if and only if f(x,) — f(a) for all x, € E which converge to a. 
Hence, by Theorem 9.33, if f and g are continuous at a point a € E (respectively, 
continuous on £), then so aref+g, af (fora € R),f-g, ||fl], and (when m = 3) 
f x g. Moreover, iff: E — R"” is continuous ata € E andg: f(E) > R? is 
continuous at f(a) € f(£), then g of is continuous ata eé E. 

We shall frequently need a stronger version of continuity. 


9.49 Definition. 
Let E be a nonempty subset of R” andf : E — R”. Then f is said to be 


uniformly continuous on E (notation: f : E > R"” is uniformly continuous) if 
and only if for every ¢ > 0 there is ad > O such that 


|Ix—al|<6 and x,aeE imply |lf(x) —f(a)|| <.«. 


As in the real case, continuity and uniform continuity of a vector function are 
equivalent on closed, bounded sets. By the powerful Heine—Borel Theorem, 
we need only verify this result for compact sets. The definition of compact sets 
allows us to construct a direct proof (compare with the proof of Theorem 3.39). 


9.50 Theorem. Let E be anonempty compact subset of R". If fis continuous on 
E, then f is uniformly continuous on E. 
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Proof. Suppose that f is continuous on E. Given ¢ > 0 anda é€ E, choose 
5(a) > 0 such that 


x €Byq(a) and x¢E imply |if(~ —f(a)|| < - 


Since 4(a)/2 is positive for all a € E, the collection {B5a)/2}acz is an open 
covering of E. By the definition of compact sets, there exist finitely many 
points a; € E and numbers 4; := 6(a;)/2 such that 


N 
EC |) Bi, (aj). (7) 


j=l 


Set 6 := min{d,,..., dy}. Clearly, 6 > 0. 

Suppose that x,a ¢ E with ||x — al] < 6. By (4), x belongs to B,(a;) for 
some | < j < N. Hence, ||a—a,|| < ||a—x||+||x—a;|| < 6; +46; = 26; = d(aj); 
that is, a also belongs to Bsa; (aj). It follows, therefore, from the choice of 
6(a;) that 


\If(x) — f(a)|| < |If€Cx) — f(a;)|| + [|fa;) —f@)|| < 5 al > =e. 


This proves that fis uniformly continuous on E. | 


Thus continuous vector functions behave much the same as continuous real 
functions. 

When we turn our attention to how continuous functions interact with the 
topological structure of R", we again find a surprising bonus. The e’s and 6’s 
disappear. 


9.51 Theorem. Suppose that E C R" and thatf: E > R”. Then f is continuous 
on E if and only if ~'(V) is relatively open in E for every V open in R™. 


Proof. Suppose that f is continuous on E and that V is open in R”. Since # 
is open, we may suppose that some a € f~'(V). By Remark 9.9, to show that 
f—!(V) is relatively open in E we need to find a 5 > 0 such that Bs(a)N E C 
f—'(V). But f(a) € V and V is open, so there is ae > O such that B,(f(a)) C V. 
Since f is continuous at a € E, choose 5 > 0 such that ||x — al| < dandxe E 
implies ||f(x) — f(a)|| < ©; that is, x € Bs(a) 9 E implies f(x) € B,(f(a)). It 
follows that f(Bs(a) M E) C B,(f(a)) C V; that is, Bs(a) N EC f 7'(V). 
Conversely, ifa € E and e > 0, then B,(f(a)) is open in R”. By hypothesis 
f—'(B,(f(a))) is relatively open in E; that is, by Remark 9.9, there is a 6 > 0 
such that B;(a)N E c f~!(B,(f(a))). We conclude that if ||x — al] < 5 and 
x € E, then ||f(x) — f(a)|| < ¢ (Le., that fis continuous ata € £). | 


By Theorem 9.51, when fis continuous on E, f~! takes open sets to relatively 
open sets in FE. If the domain E is open (e.g., if the domain is R”), then the 
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word relatively can be dropped (see Exercise 9.7.3). We shall refer to this prop- 
erty by saying that open sets are invariant under inverse images by continuous 
functions. 

Analogues of these results also hold for closed sets (see Exercises 9.7.5a 
and 9.7.4). In particular, if f is continuous and its domain is a closed set, then 
f—! takes closed sets to closed sets. Thus closed sets are invariant under inverse 
images by continuous functions. 

It is natural to ask whether bounded sets or connected sets are invariant under 
inverse images by continuous functions. The following examples show that the 
answers to these questions are no, even when the range and domain are one 
dimensional. 


9.52 EXAMPLES. 


i) If f(x) = 1/(@@? + 1) and E = (0, 1], then f is continuous on R and E is 
bounded, but f ~!(E) = (—oo, 00) is not bounded. 

ii) If f(x) = x* and E = (1,4), then f is continuous on R and E is connected, 
but f ~!(£) = (—2, —1) U (1, 2) is not connected. 


We now turn our attention from inverse images of sets to images of sets. Are 
open sets and closed sets invariant under images by continuous functions? The 
following examples show that the answers to these questions are also no. 


9.53 EXAMPLES. 


i) If f@me= x* and V = (—1, 1), then f is continuous on V and V is open, but 
f(V) = [0, 1) is neither open nor closed. 

ii) If f(x) = 1/x and E = [1, ov), then f is continuous on E and E is closed, 
but f(£) = (0, 1] is neither open nor closed. 


As the next result shows, however, if a set is both closed and bounded (i.e., 
compact), then so is its image under any continuous function. This innocent- 
looking result has far-reaching consequences which we shall exploit on many 
occasions. 


9.54 Theorem. If H is compact in R" and f: H — R" is continuous on H, then 
f(H) is compact in R™. 


Proof. Suppose that {Vy}vc4 is an open covering of f(H). Then, by Theorem 
1.37, parts iii and v, {f-'(Va)}vea covers H. But by Theorem 9.51, £-! (Vy) 
are relatively open in H; that is, there exist open sets Og such that f (Vy) = 
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04H. Since {Og}aea iS an open covering of H and H is compact, there exist 
a; € Asuch that H C Wh Ou; We conclude by Theorem 1.37, parts i and 
v, that 


N N N 
cH) St] 04,08] =Ut(t a) =U Va); 
j=l j=l j=l 
that is, f(H) is compact by definition. | 


Connected sets are also invariant under images by continuous functions. 


9.55 Theorem. I[f E is connected in R" andf: E — R" is continuous on E, then 
f(£) is connected in R™. 


Proof. Suppose that f(£) is not connected. By Definition 9.10, there exist 
a pair of relatively open sets U, V in f(E) which separates f(£); that is, UN 
f(E) £9, VOF(E) 4%, f(E) = UUV, and UNV = 9. Set A := f7!(U) 
and B := f~'(V). By Exercise 9.7.5b, A and B are relatively open in E. Since 
f(E) = UUV and both f~!(U) and f~!(V) are subsets of E, we also have (see 
Theorem 1.37iii) 


E=f~'!(u)ut (Vv) =AUB. 


Finally, UN V = @ implies f—'!(U) Nf7!(V) = @ (ie., AN B = #). Thus A, B 
is a pair of relatively open sets which separates E (i.e., E is not connected, a 
contradiction). | 


Keeping track of which kind of sets are invariant under images and inverse 
images by continuous functions is a powerful tool. To illustrate this fact, we 
offer the following four results. 


9.56 Remark. The graph y = f (x) of a continuous real function f on an interval 
[a, b] is compact and connected. 


Proof. The function F(x) = (x, f(x)) is continuous from [a, b] into R?, and 
the graph of y = f(x) for x € [a, b] is the image of [a, b] under F. Hence the 
graph of f is compact and connected by Theorems 9.54 and 9.55. a 


It is interesting to note that this property actually characterizes continuity of 
real functions (see Theorem 9.71). 

To appreciate the perspective that the topological point of view gives, com- 
pare the following simple proof with that of its one-dimensional analogue 
(Theorem 3.26). 
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9.57 Theorem. [EXTREME VALUE THEOREM]. 
Suppose that H is a nonempty subset of R" and that f : H > R. If His 
compact, and f is continuous on H, then 


M:=sup{f(x):x¢€H} and m := inf{f(x):x€ A} 


are finite real numbers. Moreover, there exist points xy, Xm € H such that 
M = f (xm) and m = f (Xm). 


Proof. By symmetry, it suffices to prove the result for M. Since H is com- 
pact, f(H) is compact by Theorem 9.54. Thus f(H) is closed and bounded 
by the Heine-Borel Theorem. Since f(H) is bounded, M is finite. By the 
Approximation Property, choose x, € H such that f(x.) ~ Mask > o. 
Since f(H) is closed, M € f(H). Therefore, there is an xy € A such that 
M = f (xm). a 


(For a multidimensional analogue of Theorem 3.29, see Exercise 9.7.9). 
The following analogue of Theorem 4.32 will be used in Chapter 13 to exam- 
ine change of parametrizations of curves and surfaces. 


9.58 Theorem. Jf H is a compact subset of R" andf : H — R" is 1-1 and 
continuous, then f~! is continuous on f(H). 


Proof. By Theorem 9.54 and the Heine—Borel Theorem, f(H) is closed. Thus, 
by Exercise 9.7.4, it suffices to show that (f —!)—! takes closed sets to closed 


sets. To this end, let E be closed in R”. Since the domain of f~! is f(H), we 
have by definition that 


(f~')-'(£) = {x € f(H):f7'(x) =y forsomey€ E}. 
Since f is 1-1, f-'(x) = ye E implies that x € f(£). Thus (f-!)-'(£) = 
f(EN HA). But ENG is closed (see Theorem 9.6) and bounded (by “the bound” 
of H), so by Theorem 9.54 and the Heine—Borel Theorem, f(E M H) is closed 
and bounded. In particular, (f-!)—!(£) = f(E 2 H) is closed. a 


The final result of this section shows that “rectangles” are connected in R”. 
9.59 Remark. [fa; <b; for j =1,2,...,n, then 
R= {(x1,...,%n) 1a; < xj < Dj} 
is connected. 
Proof. Suppose not. Choose nonempty sets U and V, relatively open in R, 


such that R= UUV and UNV =9. Leta € U andbe V, and consider the 
line segment E := {ta+ (1 —1f)b: t € [0, 1]}. Since E is a continuous image 
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of the interval [0, 1], we have by Theorems 9.12 and 9.55 that E is connected. 
On the other hand, since E C R by the definition of R, it is easy to check that 
Up := UNE and Vo := VO E are nonempty sets which are relatively open in 
E and satisfy E = Ug U Vp and Up N Vo = &. It follows that F is not connected, 


a contradiction. | 
EXERCISES 
9.7.1. Define f and g on R by f(x) = cosx and g(x) = |x|/x if x 4 0 and 


9.7.2. 


9.7.4. 


9.7.5. 


9.7.6. 


9.7.3 |. 


*9.7.7 | 


2(0) =0. 


a) Find f(£) and g(£) for E = (0, 27), E = [0, 27], E = (0, 1), and 
E = [0, 1]. Compare your answers with what Theorems 9.51, 9.54, 
and 9.55 predict. Explain any differences you notice. 

b) Find f—!(E£) and g~!(E) for E = (0,1), E = [0, 1], E = (-1, )), 
and EF = [—1, 1]. Compare your answers with what Theorems 9.51, 
9.54, and 9.55 predict. Explain any differences you notice. 


Define f on [0, co) and g on R by f(x) = x and g(x) = 1/x ifx #0 
and g(0) = 0. 


a) Find f(£) and g(£) for E = (0,1), E = [0,1), and E = [0,1]. 
Compare your answers with what Theorems 9.51, 9.54, and 9.55 
predict. Explain any differences you notice. 

b) Find f~!(E£) and g~!(£) for E = (—1, 1) and E = [-1, 1]. Com- 
pare your answers with what Theorems 9.51, 9.54, and 9.55 predict. 
Explain any differences you notice. 


This exercise is used in this section and in Chapter 11. Suppose that A 
is open in R” andf: A > R”. Prove that f is continuous on A if and 
only if f—'!(V) is open in R” for every open subset V of R”. 

Suppose that A is closed in R” andf: A > R”. Prove that f is contin- 
uous on A if and only if f—!(£) is closed in R” for every closed subset 
E of R”. 

Suppose that E C R” and thatf: E > R”. 


a) Prove that f is continuous on E if and only if f~!(B) is relatively 
closed in EF for every closed subset B of R”. 

b) Suppose that f is continuous on E. Prove that if V is relatively 
open in f(£), then f—!(V) is relatively open in E, and if B is rela- 
tively closed in f(£), then f —!(B) is relatively closed in E. 


Prove that 


—1/|x—y| 
fe y=4" ea 
0 x=y 


is continuous on R?. 
This exercise is used in Section *9.8. Let H be a nonempty, closed, 
bounded subset of R”. 
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9.7.8. 


9.7.9. 


*9,7.10 |. 


a) Suppose that f: H — R” is continuous. Prove that 


I|f|| 7:= sup ||£Cx)|| 
xeH 


is finite and there exists an xo € H such that ||f(xo)|| = |IfIlz. 

b) Asequence of functions f, : H — R’” is said to converge uniformly 
on H to a function f : H — R” if and only if for every e > 0 there 
isan N &€ N such that 


k>N and xe€AH imply _ |lf(x) —f(x)|| <e. 


Show that ||f; —f||7 — Oask — ooif and only if f; — f uniformly 
on H ask > oo. 

Prove that a sequence of functions f; converges uniformly on AH if 
and only if for every ¢ > 0 there is an N € N such that 


Cc 


wa 


k,j =>N implies ||f —fjlla <e. 


Let E Cc R" and suppose that D is dense in E (i.e., that D C E and 
D = E). Iff: D — R"” is uniformly continuous on D, prove that f has 
a continuous extension to E; that is, prove that there is a continuous 
function g : E — R” such that g(x) = f(x) for all x € D. 

[INTERMEDIATE VALUE THEOREM]. Let E be a connected subset of R”. 
If f : E > Riscontinuous, f(a) 4 f(b) for some a, b € E, and yisa 
number which lies between f(a) and f(b), then prove that there is an 
x € E such that f(x) = y. (You may use Theorem 9.12.) 


This exercise is used to prove *Corollary 11.35. 


a) A set E C R" is said to be polygonally connected if and only if 
any two points a,b € E can be connected by a polygonal path 
in E; that is, there exist points x, € E, k = 1,...,N, such that 
Xp = a, Xy = band L(xy_1; xx) C E fork =1,..., N. Prove that 
every polygonally connected set in R” is connected. 

b) Let E C R"” be open and xp € E. Let U be the set of points 

x € E which can be polygonally connected in E to xo. Prove that 

U is open. 

Prove that every open connected set in R” is polygonally con- 

nected. 


Cc 


wa 


*9.8 APPLICATIONS 


This section uses no material from any prior enrichment sections. 


We have seen that topological concepts (e.g., closed sets, open sets, and con- 
nected sets) are powerful theoretical tools. In this section we continue de- 
veloping this theme by obtaining three independent theorems (i.e., you may 
cover them in any order) which further elucidate results we obtained in earlier 


chapters. 
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Our first application of topological ideas is a partial converse of Theorem 7.10. 
A sequence of real valued functions { f,} is said to be pointwise increasing (re- 
spectively, pointwise decreasing) on a subset E of R" if and only if f,(x) < 
fix) [respectively, f(x) > fe4i(x)] for allx ¢ E andk e N. A sequence 
is said to be pointwise monotone on E if and only if it is pointwise increasing on 
E or pointwise decreasing on E. 


9.60 Theorem. [DINI]. 

Suppose that H is a compact subset of R" and that f,.: H — R is a pointwise 
monotone sequence of continuous functions. If f, — jf pointwise on H as 
k — coand if fis continuous on H, then f, — f uniformly on H. In particular, 
if by is a pointwise monotone sequence of functions continuous on an interval 
[a, b] which converges pointwise to a continuous function, then 


b b 
jim, [ x(t) a= | (tim 6x(0) dt. 


Proof. By Theorem 7.10, we need only show that f, — f uniformly on H. 
We may suppose that /; is pointwise increasing and that H # 9. 
Let e > 0. For each x € H, choose N(x) € N such that 


k= N(x) implies | fi(x) — f00| < 5. 
Since f and f(x) are continuous on H, choose an r = r(x) > 0 such that 


y €HOB,(x) implies | f(x) — fly)| < ; and | fya@® — fr (yl < > 


By the Heine-Borel Theorem, choose x; € H andr; = r(x;) such that 


M 
HC | J B,, (xj). 
j=l 


Set N = max{N(xj),..., N(xy)}, let x € H, and suppose that k > N. Since 
x € B, (xj) for some j € {1,..., WM} andk > N(x;), it follows that 


If) — fk] = FX) — fe) < fF) — fvexjyy 
<If@®) — faPl+ IFA) — fr~y&/)| 
+ | fx) xp) — frx;) 00! 


Bise aro e ao ts 
3 3 3° 


Since this inequality holds for all x € H, we conclude that f, — f uniformly 
on H ask > o. 
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Our next application of topological ideas is a characterization of Riemann 
integrability of a function f by the size of the set of points of discontinuity of f. 
To measure the size of this set, we make the following definition. (Recall that 
|I| denotes the length of an interval /.) 


9.61 Definition. 


i) Aset E Cc Ris said to be of measure zero if and only if for every ¢ > 0 there 
is a countable collection of intervals {Jj} jen which covers E such that 


ii) A function f : [a,b] — R is said to be almost everywhere continuous on 
[a, b] if and only if the set of points x € [a, b] where f is discontinuous is a 
set of measure zero. 


Notice that, by definition, if E is of measure zero, then every subset of E is 
also of measure zero. Loosely speaking, a set is of measure zero if it is so sparse 
that it can be covered by a sequence of intervals whose total length is as small 
as we wish. 

It is easy to see that a single point E = {x} is a set of measure zero. Indeed, 
I := (« — e/2,x + 6/2), Ik := 0, fork > 2, cover E and have total length e. 
Modifying this technique, we can show that any finite set is a set of measure zero 
(see also Remark 9.62 below). On the other hand, by the Heine—Borel Theorem, 
any open covering of [a, b] has a finite subcovering; hence, any covering of [a, b] 
by open intervals must have total length greater than or equal to b — a. In 
particular, a nondegenerate interval cannot be of measure zero. 

The following result shows that if a set is small in the set theoretical sense, 
then it is small in the measure theoretical sense. 


9.62 Remark. Every at most countable set of real numbers is a set of measure 
zero. 


Proof. We may suppose that E is countable, say E = {x;, x2,...}. Givene > 0 
and j €N, set 
Tj = (ej —e2-F 1 xj ter), 
Then x; € J; and |Jj| = e2-/ for j € N. Therefore, E C UF Fj and 


[e,2) Co 


Viglsey ae z 
J 


j=l j=1 
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The converse of Remark 9.62 is false; that is, there exist uncountable sets of 
measure zero (see Exercise 9.8.9 below). 

The following result shows that the countable union of sets of measure zero 
is a set of measure zero. 


9.63 Remark. [f E,, Eo, ... is a sequence of sets of measure zero, then 


is also a set of measure Zero. 


Proof. Let « > 0. By hypothesis, given k € N we can choose a collection of 
intervals {1} jen Which covers E; such that 


Then the collection ele jen is countable, covers E, and 
(oe) CO CO € 
(k) = 
DD MS oa 
k=1 j=l k=1 
Consequently, E is of measure zero. a 


To facilitate our discussion of points of discontinuity, we introduce the follow- 
ing concepts. 


9.64 Definition. 
Let [a, b] be a closed interval and f : [a,b] ~ R be bounded. 


i) The oscillation of f on an interval J which intersects [a, b] is defined to be 


QrJ):= sup (f(x) — fy). 


x,yeJN[a,b] 
ii) The oscillation of f at a point t € [a, b] is defined to be 


op(O:= lim Qp(@— ht +h), 


when this limit exists. 
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9.65 Remark. Jf f : [a,b] — R is bounded, then ws (t) exists for all t € [a, b] 
and satisfies 0 < wf (t) < 00. 


Proof. Fix t € [a, b] and for each interval J, set 


M;= sup f(x), my= inf f(a). 
ee eae 


Since sup(— f (x)) = — inf f(x), it is obvious that 
Q (J) = Mj —my = 0. (8) 


Suppose for simplicity that t € (a, b), and choose hp so small that (¢ — ho, 
t + ho) C (a,b). For each 0 < h < hg, set 


F(h) = Q(t —h, t +h). 


By the Monotone Property of Suprema, F(A) is increasing on (0, ho) and, 
hence, has a finite limit as h > 0+. By (5), F(h) = 0. Therefore, w(t) exists 
and is both finite and nonnegative. a 


The next result shows that, by using the oscillation function wy, we can rep- 
resent the set of points of discontinuity of any bounded f as a countable union. 


9.66 Remark. Let f : [a,b] — R be bounded. If E represents the set of points 
of discontinuity of f in (a, b], then 


B= {! € [a,b]: wf (t) = -}. 
j=l 


Proof. By (5), f is continuous at t € [a, b] if and only if w¢(t) = 0. Hence, 
t belongs to E if and only if w(t) > 0. Since, by the Archimedean Principle, 


w(t) > 0 if and only if wf(t) = 1/j for some j € N, the result follows 
at once. a 


We need two technical results about the oscillation of f at a point f. 


9.67 Lemma. 
Let f : [a,b] — R be bounded. For each « > 0, the set 


H = {t € [a,b]: wf(t) = ¢} 


is compact. 
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Proof. By definition, H is bounded (by max{|a|, |b|}). Hence, if the lemma is 


false, then H is not closed. Hence, there are points t, € H such that % — f as 
k + o butt ¢ H. Since w¢(t) < e, it follows that there is an ho > 0 such that 


Q(t —ho,t +ho)) <€. (9) 


Since t — t, choose N € N so that 


ho ho 
IN C (t—ho,t +ho). 


Then, by (6), Q2¢((tv — ho/2, tn + ho/2)) < ¢. Therefore, wf (tv) < €, which 
contradicts the fact that ty € H. a 


9.68 Lemma. 

Let I be a closed, bounded interval and f : I — R be bounded. If ¢ > 0 and 
w(t) < € forallt € I, then there is aé& > 0 such that Q (J) < € for all closed 
intervals J C I which satisfy |J| < 6. 


Proof. For each t € I, choose 6; > 0 such that 


Q(t — 6;,¢ + 6;)) < e. (10) 
Since 6,/2 > 0, use the Heine—Borel Theorem to choose f1, ..., ty such that 
N 
Or, 5r, 
I t;-—,t;4+2 
€ U (« pt ) 
j=l 
and set 
Or 
56= mn —. 
1<j<N 2 
If J CJ, then 


br, br, 
JIN dare eases #O 


for some j € {1,..., N}. If J also satisfies |J| < 4, then it follows from 25 < 4;, 
that J C (t; — 6;;, tj + 6:,). In particular, (7) implies 


Qp(T) < Qp (ty — 8, tj +44,)) <e. a 
9.69 Theorem. [LEBESGUE]. 


Let f : [a,b] — R be bounded. Then f is Riemann integrable on [a, b] if 
and only if f is almost everywhere continuous on [a, b]. In particular, if f is 
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bounded and has countably many points of discontinuity on [a, b], then f is 
integrable on [a, b]. 


Proof. Let E be the set of points of discontinuity of f in [a, b]. Suppose that 
f is integrable but E is not of measure zero. By Remarks 9.63 and 9.66, there 
is a jo € N such that 


A:= { € [a,b]: wf (t) = =| 
JO 


is not of measure zero. In particular, there is an ¢9 > O such that if {Jc }xen is 
any collection of intervals which covers H, then 


Co 
Yoel = £0. (11) 
k=l 


Let P = {xo,..., Xn} be a partition of [a, b]. If (xp_-1, x.) N H & G, then, by 
definition, Mz(f) — mg(f) = 1/jo. Hence, 


U(f, P) — L(f, P) = D (Mk f) — me (f)) Axx 


k=1 

Yo MRA) — mf) Axe 
(xK-1, XA) NH AD 
1 


Jo 


IV 


> Axx 


(xK-1, XK) NH AG 


But { [xg—1, xk] : Oce—1, x) A H 4 GO} is a collection of intervals which covers 
H. Hence, it follows from (8) that 


U(f, P) — L(f, P) > - 0: 
0 


Therefore, f cannot be integrable on [a, b]. 
Conversely, suppose that E is of measure zero. Let M = sup, cq) f(«) and 
m = inf, <[a,p] f (x). Given ¢ > 0, choose jo € N such that 


M—-m+b—-a 
Jo 


<é. 
Since E is of measure zero, so is 


H= {! € [a,b]: w(t) = =}. 
JO 
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Hence, by Definition 9.61, there exists a collection of intervals which covers 
H, whose lengths sum to a real number less than 1/(2j9). By expanding these 
intervals slightly, we may suppose that there exist open intervals I, Io, ... 
which cover H such that 


> 


= 1 
Yolhl< =. 
val JO 


Hence, by Lemma 9.67, we can choose N ¢€ N such that {l, h,..., Ivy} 
covers H and 


= 1 
Viisl< a (12) 
v=1 


We must find a partition P such that U(f, P) — L(f, P) < «. The end- 
points of the /,’s form part of this partition. Other points will come from 
further division of that part of [a,b] not covered by the J,’s. Indeed, let 
Ic [a, b]\(UN_, 1). Since the /,’s cover H, w(t) < 1/jo for allt € I’. 
Hence, by Lemma 9.68, there is a 6 > 0 such that if J C I’ satisfies |J| < 4, 
then Q (J) < 1/jo. Subdivide [a, b\w i5) into intervals J;, € = 1,...,5, 
such that |J;| < 6. Then 


1 
Qf(Je) < — (13) 
Jo 


for =1,...,5. 

Let P = {xo, x1,...,%X,} represent the collection of points x such that x is 
an endpoint of some /,, or of some Je. Notice that if (x,_1, x.) NH 4 @, then 
X,—1 and x; are endpoints of some /,,, whence, by (9), 


M— 
Yo MRA) — me (PAE < 


(xx—1,.XK) NH AD 


On the other hand, if (xg-1, x.) N H = @, then x,_, and x, are endpoints of 
some Je, whence, by (10), 


ee b— 
Yo (MCF) — mf) Axe < — Yo Axe = —. 
(x14 )NH=D JO py Jo 
Consequently, 
n M— 7 
U(f, P) — Lf, P) = > (Mf) — me(f)) Axe < a ee 
k=l 


We conclude that f is integrable on [a, b]. a 
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Recall that if a > 0 and f(x) is positive, then 
f*(x):=e* log(f(x)) | 


Suppose that f is Riemann integrable. Although Corollary 5.23 implies that 
f” is integrable for each n € N, we have not yet investigated the integrability 
of noninteger powers of f (e.g., /f and ./f). The following result shows that 
Lebesgue’s Theorem answers the question of integrability for all positive powers 
of f, rational or irrational. 


9.70 Corollary. If f : [a,b] — [0, 00) is Riemann integrable, then so is f® for 
every a > 0. 


In our final application, we use connectivity to characterize the graph of a 
continuous function. 


9.71 Theorem. [CLOSED GRAPH THEOREM]. 
Let I be a closed interval and f : I > R. Then f is continuous on I if and only 
if the graph of f is closed and connected in R?. 


Proof. For any interval J C J, let G(J) represent the graph of y = f(x) for 
x € J. Suppose that f is continuous on J. The function x +~> (x, f(x)) is 
continuous from / into R?, and / is connected in R. Thus G(/) is connected in 
R’ by Theorem 9.55. To prove that G(/) is closed, we shall use Theorem 9.37, 
Let x, € I and (xz, f(x~)) > (x, y) ask > oo. Then x, > x and f(x,) > y, 
as k — oo. Hence, x € J and since f is continuous, f(x.) > f(x). In 
particular, the graph of f is closed. 

Conversely, suppose that the graph of f is closed and connected in R?. 
We first show that f satisfies the Intermediate Value Theorem on /. Indeed, 
suppose to the contrary that there exist xy < x2 in J with f(x) 4 f(x) 
and a value yo between f(x;) and f(x2) such that f(t) ¥ yo for all t € [x1, x2]. 
Suppose for simplicity that f(x) < f (x2). Since f(t) 4 yo for any t € [x1, x2], 
the open sets 


U ={(x, y):x <x1}U{@, y): x < x2, y < yo}, 
V={@, y):x > x} U{@, y):x > x1, y > yo} 


separate G(J), a contradiction. Therefore, f satisfies the Intermediate Value 
Theorem on /. 

If f is not continuous on /, then there exist numbers x9 € J, ¢9 > 0, and 
xx € I such that x, — xo and | f(x.) — f(xo)| > &0. By symmetry, we may 
suppose that f(x.) > f(xo) + €0 for infinitely many k’s, say 


fk) > f(x) + €0 > fo), JEN. 


By the Intermediate Value Theorem, choose c; between Xk; and xg such that 
f(cj) = f(xo) + €0. By construction, (c;, f(cj)) > (xo, f(xo) + €0) and 
cj > xo as j — o. Hence, the graph of f on J is not closed. 2) 
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EXERCISES 


9.8.1. 


9.8.2. 


9.8.3. 


9.8.4. 


9.8.5. 


9.8.6. 


9.8.7. 


Suppose that f; : [a,b] > [0, oo) for k € N and that 
f):=)o fe) 
k=l 


converges pointwise on [a,b]. If f and f, are continuous on [a, b] for 
each k € N, prove that 


b 0 CO Ab 
if Dif) dx = D0] fil) de. 
@ k=l at a 


Let E be closed and bounded in R”. Suppose that g, fx, gx : E — Rare 
continuous on £ with g, > Oand f; > fo... > fe => Ofork EN. If 
& = > p21 gk converges pointwise on E, prove that )°?°., fig, converges 
uniformly on E. 

Suppose that f, f; : R — R are continuous and nonnegative. Prove that 
if f(x) ~ Oasx — too and fy t f everywhere on R, then f, — f 
uniformly on R. 


For each of the following functions, find a formula for w(t). 
1 x€EQ 
vsonfh 22 
1 x>0 
b = es 
) £@) E x <0 
_ Jsin(//x) x #0 
2 rove feat a 
Prove that (1 — x/ kjk > e* uniformly on any closed, bounded subset 


of R. 

Show that if f : [a,b] — R is integrable and g: f([a, b]) > R is contin- 
uous, then g o f is integrable on [a, b]. (Notice by Remark 3.34 that this 
result is false if g is allowed even one point of discontinuity.) 

Using Theorem 7.10 or Theorem 9.30, prove that each of the following 
limits exists. Find a value for the limit in each case. 


k 
[2 
a) Jaa or S88 ao 
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9.8.8. 


9.8.9. 


Convergence in R” 
b) lim f et f as dx where f is continuously differentiable on 
k00°9 2k3 +x y 
[0, 1] and f' (0) > 0. 


logk — x 
lim f°, 2x4 Se eeeeat 
© fim, f 29 cs (SIS 


: 1 a\P 7 
d) tim f') (1+ 7) eax 


a) Prove that for every e« > 0 there is a sequence of open intervals {Ik }xen 
which covers [0, 1] MN Q such that 


[e,2) 


Soll <6. 


k=1 


b) Prove that if {7;},en is a Sequence of open intervals which covers [0, 1], 
then there is an N € N such that 


N 
ese 
k=1 


Let £, be the unit interval [0, 1] with its middle third (1/3, 2/3) removed 
(i.e., Ey = [0, 1/3]U[2/3, 1]). Let Ey be E, with its middle thirds removed; 
that is, 


E> = [0, 1/9] U [2/9, 1/3] U [2/3, 7/9] U [8/9, 1]. 


Continuing in this manner, generate nested sets E, such that each E; is 
the union of 2 closed intervals of length 1/3*. The Cantor set is the set 


CO 
E:= () Ex. 
k=1 


Assume that every point x € [0, 1] has a binary expansion and a ternary 
expansion; that is, there exist az € {0, 1} and by € {0, 1, 2} such that 


(e.g., if x = 1/3, then a2,_; = 0, ao, = 1 for allk and either b; = 1, bk =0 
fork > 1 orb; =O and db; = 1 for all k > 1.) 
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a) Prove that E is a nonempty compact set of measure zero. 

b) Show that a point x € [0, 1] belongs to E if and only if x has a ternary 
expansion whose digits satisfy b, 4 1 for allk EN. 

c) Define f : E — [0, 1] by 


Prove that there is a countable subset Eo of E such that f is 1-1 from 
E\ Ep onto [0, 1] (i.e., prove that E is uncountable). 
d) Extend f from E to [0, 1] by making f constant on the middle thirds 
Ex_-1\Ex. Prove that f : [0, 1] — [0, 1] is continuous and increasing. 
(Note: The function f is almost everywhere constant on [0, 1]; that 
is, constant off a set of measure zero. Yet it begins at f(0) = 0 and 
ends at f(1) = 1.) 


CHAPTER _~ 10 


Metric Spaces 


10.1 


This chapter, an alternative to Chapter 9, covers topological ideas in a metric 
space setting. [f you have already covered Chapter 9, skip this one and proceed 
directly to Chapter 11. 


INTRODUCTION 


The following concept shows up in many parts of analysis. 


10.1 Definition. 


A metric space is a set X together with a function p : X x X — R (called the 
metric of X) which satisfies the following properties for all x, y, z € X: 


POSITIVE DEFINITE (x, y) > O with p(x, y) = Oif and only if x = y, 
SYMMETRIC p(x, y) = p(y, x), 
TRIANGLE INEQUALITY (x, y) < p(x,z) + eZ, y). 


[Notice that by definition, o(x, y) is finite valued for all x, y € X.] 


We are already very familiar with a whole class of metric spaces. 


10.2 EXAMPLE. 


Every Euclidean space R” is a metric space with metric p(x, y) = ||x — y||. 
(We shall call this the usual metric on R". Unless specified otherwise, we shall 
always use the usual metric on R”.) 


Proof. By Theorems 1.7 and 8.6, ¢ is a metric on R”. a 


For the remainder of this chapter, unless otherwise noted, X and Y will denote 
general metric spaces with respective metrics p and t. We shall develop a theory 
of convergence (for both sequences and functions) for arbitrary metric spaces. 
According to Example 10.2, this theory is valid (and will be used by us almost 
exclusively) on R”. Why, then, subject ourselves to such stark generality? Why 
not stick with the concrete Euclidean space case? There are at least three 
answers to these questions: 


1) Economy. You will soon discover that there are many other metric spaces 
which crop up in analysis (e.g., all Hilbert spaces, all normed linear spaces, 


356 
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and many function spaces, including the space of continuous functions on 
a closed, bounded interval). Our general theory of convergence in metric 
spaces will be valid for each of these examples, too. 

Visualization. As we mentioned in Section 1.2, analysis has a strong geomet- 
ric flavor. Working in an abstract metric space only makes that aspect more 
apparent. 

Simplicity. Emphasizing the fact that R” is a metric space strips R of all extra- 
neous details (the field operations, the order relation, decimal expansions) so 
that we can focus our attention on the underlying concept (distance) which 
governs convergence. Mathematics frequently benefits from such abstrac- 
tion. Instead of becoming more difficult, generality actually makes the proofs 
easier to construct. 


2 


wa 


3 


wa 


On the other hand, R? provides a good and sufficiently general model for most 
of the theory of abstract metric spaces (especially convergence of sequences 
and continuity of functions). For this reason, we often draw two-dimensional 
pictures to illustrate ideas and motivate proofs in an arbitrary metric space. 
(For example, see the proof of Remark 10.9 below.) We must not, however, mis- 
lead ourselves by believing that R* provides a complete picture. Metric spaces 
have such simple structure that they can take on many bizarre forms. With that 
in mind, we introduce several more examples. 


10.3 EXAMPLE. 
R is a metric space with metric 


0 x=y 
1 x #y. 


(This metric is called the discrete metric.) 


o(x,y)= 


Proof. The function o is obviously positive definite and symmetric. To prove 
that o satisfies the Triangle Inequality, we consider three cases. If x = z, then 
o(x,y) =0+o0(z, y) = o(x,z) +0, y). A similar equality holds if y = z. 
Finally, if x #zand y 4 z, theno(x, y) <1 <2=o0(4,z)+o0(z, y). | 


Comparing Examples 10.2 and 10.3, we see that a given set can have more 
than one metric. Hence, to describe a particular metric space, we must specify 
both the set X and the metric p. 


10.4 EXAMPLE. 


If E C X, then E is a metric space with metric p. (We shall call such metric 
spaces E subspaces of X.) 


Proof. If the Positive Definite Property, the Symmetric Property, and the 
Triangle Inequality hold for allx, y ¢ X, then they holdforallx,yeF. 


A particular example of a subspace is provided by the set of rationals in R. 
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10.5 EXAMPLE. 


Q is a metric space with metric p(x, y) = |x — yl. 
Metric spaces are by no means confined to numbers and vectors. Here is an 
important metric space whose “points” are functions. 


10.6 EXAMPLE. 


Let C[a, b] represent the collection of continuous f : [a, b] > Rand 


fll = sup |f(@). 


x€[a,b] 
Then p(f, g) := || f — gl is a metric on C[a, D]. 


Proof. By the Extreme Value Theorem, || || is finite for each f € C[a, b]. 
By definition, || f|| > 0 for all f, and || f|| = 0 if and only if f(x) = 0 for 
every x € [a, b]. Thus p is positive definite. Since p is obviously symmetric, it 
remains to verify the Triangle Inequality. But 


If +gll= sup |f@)+gsQ@)|< sup [f@+ sup [g@)|=Ifll+Isl. a 


xé[a,b] x€[a,b] x€[a,b] 


It is interesting to note that convergence in this metric space means uniform 
convergence (see Exercise 10.1.8). 

There are two ways to generalize open and closed intervals to arbitrary metric 
spaces. One way is to use the metric directly as follows. 


10.7 Definition. 


Let a € X andr > 0. The open ball (in X) with center a and radius r is the set 
B,(a) := {x € X: p(x, a) <7}, 
and the closed ball (in X) with center a and radius r is the set 


{x € X: p(x, a) <r}. 


Notice by Theorem 1.6 that in R (with the usual metric), the open ball 
(respectively, the closed ball) centered at a of radius r is (a — r,a +r) (respec- 
tively, [a—r, a+r]) (i.e., open balls are open intervals and closed balls are closed 
intervals). With respect to the discrete metric, however, balls look quite differ- 
ent. For example, for each 0 < r < 1 and each a in the discrete metric space, the 
closed and open balls centered at a of radius r are both equal to {a}. 

Another way to generalize open and closed intervals to X is to specify what 
open and closed mean. Notice that every point x in an open interval J is sur- 
rounded by points in J. The same property holds for complements of closed 
intervals. This leads us to the following definition. 
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10.8 Definition. 


i) A set V C X is said to be open if and only if for every x € V there is an 
é > Osuch that the open ball B; (x) is contained in V. 
ii) Aset E C X is said to be closed if and only if E° := X \ E is open. 


Our first result about these concepts shows that they are consistent as applied 
to balls. 


10.9 Remark. Every open ball is open, and every closed ball is closed. 


Proof. Let B,(a) be an open ball. By definition, we must prove that given 
x € B,(a) there is an e > O such that B,(x) C B,(a). Let x € B,(a) and set 
€ =r — p(x,a). (Look at Figure 9.1 to see why this choice of ¢ should work.) 
If y € B,(x), then by the Triangle Inequality, assumption, and the choice of e, 


p(y, a) < p(x) + p(x, a) < e+ p(x%,a) =r. 


Thus, by Definition 10.7, y € B,(a). In particular, B;(x) C B,(a). Similarly, 
we can show that {x € X : p(x, a) > r} is also open. Hence, every closed ball 
is closed. | 


Here are more examples of open sets and closed sets. 
10.10 Remark. [fa € X, then X \ {a} is open and {a} is closed. 


Proof. By Definition 10.8, it suffices to prove that the complement of 
every singleton E := {a} is open. Let x € E° and set ¢ = p(x,a). Then, by 
Definition 10.7, a ¢ B,(x), so Be(x) C E*. Therefore, E° is open by 
Definition 10.8. a 


Students sometimes mistakenly believe that every set is either open or closed. 
Some sets are neither open nor closed (like the interval [0, 1)). And, as the 
following result shows, every metric space contains two special sets which are 
both open and closed. 


10.11 Remark. Jn an arbitrary metric space, the empty set 6 and the whole space 
X are both open and closed. 


Proof. Since X = 0° and @ = X°, it suffices by Definition 10.8 to prove that 4 
and X are both open. Because the empty set contains no points, “every” point 
x € OG satisfies B,(x) C B. (This is called the vacuous implication.) Therefore, 
@ is open. On the other hand, since B,(x) C X for all x € X and alle > 0, it is 
clear that X is open. 2) 


For some metric spaces (like R”), these are the only two sets which are simul- 
taneously open and closed. For other metric spaces, there are many such sets. 
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10.122 EXAMPLE. 


Every subset of the discrete space R is both open and closed. 


Proof. It suffices to prove that every subset of R is open (with respect to the 
discrete metric). Let E C R. By Remark 10.11, we may assume that EF is 
nonempty. Let a € E. Since B,(a) = {a}, some open ball containing a is a 
subset of E. By Definition 10.8, E is open. a 


To see how these concepts are connected with limits, we examine convergence 
of sequences in an arbitrary metric space. Using the analogy between the met- 
ric p and the absolute value, we can transfer much of the theory of limits of 
sequences from R to any metric space. Here are the basic definitions. 


10.13 Definition. 
Let {x,} be a sequence in X. 


i) {x,} converges (in X) if there is a point a € X (called the /imit of x,) such 
that for every ¢ > 0 there is an N € N such that 


n>WN implies p(xn,a) <. 
ii) {x,}is Cauchy if for every ¢ > 0 there is an N € N such that 
n,m>N implies p(%,xXm) < &. 


iii) {x,} is bounded if there is an M > 0 andab € X such that p(x,,b) < M 
for alln EN. 


Modifying the proofs in Chapter 2, by doing little more than replacing |x — y| 
by (x, y), we can establish the following result. 


10.14 Theorem. Let X be a metric space. 


i) A sequence in X can have at most one limit. 
ii) If x, € X converges to a and {xn,} is any subsequence of {Xn}, then Xn, con- 
verges toaask > o. 
iii) Every convergent sequence in X is bounded. 
iv) Every convergent sequence in X is Cauchy. 


The following result shows that, by using open sets, we can describe conver- 
gence of sequences in an arbitrary metric space without reference to the dis- 
tance function. Later in this chapter, we shall use this point of view to great 
advantage. 


10.15 Remark. Let x, € X. Then x, > aasn — oo if and only if for every 
open set V which contains a there isan N € N such that n > N implies x, € V. 
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Proof. Suppose that x, — a, and let V be an open set which contains a. By 
Definition 10.8, there is an e > O such that B,(a) C V. Given this ¢, use 
Definition 10.13 to choose an N € N such that n > N implies x, € B;(a). By 
the choice of ¢, x, € V foralln > N. 

Conversely, let ¢ > 0 and set V = B,(a). Then V is an open set which 
contains a; hence, by hypothesis, there is an N € N such that n > N implies 
Xn € V. In particular, p(x, a) < eforalln > N. |_| 


The following result, which we shall use many times, shows that convergent 
sequences can also be used to characterize closed sets. 


10.16 Theorem. Let E C X. Then E is closed if and only if the limit of every 
convergent sequence xx € E satisfies 


lim x, € E. 


=> 0o 


Proof. The theorem is vacuously satisfied if E is the empty set. 

Suppose that E #4 9 is closed but some sequence x, € E converges to a 
point x € E*. Since E is closed, E° is open. Thus, by Remark 10.15, there is 
an N € Nsuch that n > N implies x, € E°, a contradiction. 

Conversely, suppose that E is a nonempty set such that every convergent 
sequence in £ has its limit in E. If E is not closed, then, by Remark 10.11, 
E # X, and, by definition, E° is nonempty and not open. Thus, there is 
at least one point x € E* such that no ball B,(x) is contained in E°. Let 
xe € Byx(x) OE fork = 1,2,.... Then x, € E and p(xx,x) < 1/k for all 
k EN. Now 1/k — Oask —> «&, so it follows from the Squeeze Theorem 
(these are real sequences) that p(x%,x) > Oask > o (ie, x, > x as 
k — oo). Thus, by hypothesis, x € E, a contradiction. | 


Notice that the Bolzano—Weierstrass Theorem and Cauchy’s Theorem are 
missing from Theorem 10.14. There is a simple reason for this. As the next 
two remarks show, neither of these results holds in an arbitrary metric space. 


10.17 Remark. The discrete space contains bounded sequences which have no 
convergent subsequences. 


Proof. Let X = R be the discrete metric space introduced in Example 10.3. 
Since o(0,k) = 1 for all k € N, {k} is a bounded sequence in X. Suppose 
that there exist integers k} < ky < ... and an x ¢€ X such thatk; > x as 
j — oo. Then there is an N € N such that o(kj,x) < 1 for j > N (ie, 
k; = x for all j > N). This contradiction proves that {k} has no convergent 
subsequences. | 


10.18 Remark. The metric space X = Q, introduced in Example 10.5, contains 
Cauchy sequences which do not converge. 
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Proof. Choose (by the Density of Rationals) points gq, € Q such that 
gk — V2. Then {qx} is Cauchy (by Theorem 10.14iv) but does not converge 
in X since /2 ¢ X. |_| 


This leads us to the following concept. 


10.19 Definition. 


A metric space X is said to be complete if and only if every Cauchy sequence 
Xn € X converges to some point in X. 


At this point, you should read Section 9.5 to see how these concepts play 
out in the concrete Euclidean space setting. Notice by Theorem 9.35 that R” 
is complete for all n € N. What can be said about complete metric spaces in 
general? 


10.20 Remark. By Definition 10.19, a complete metric space X satisfies two 
properties: 1) Every Cauchy sequence in X converges; 2) the limit of every Cauchy 
sequence in X stays in X. 


Property 2), by Theorem 10.16, means that X is closed. Hence, it is natural 
to ask, Is there a simple relationship between complete subspaces and closed 
subsets? 


10.21 Theorem. Let X be a complete metric space and E be a subset of X. Then 
E (as a subspace) is complete if and only if E (as a subset) is closed. 


Proof. Suppose that E is complete and that x, € E converges. By The- 
orem 10.14iv, {x,} is Cauchy. Since FE is complete, it follows from Defini- 
tion 10.19 that the limit of {x,} belongs to E. Thus, by Theorem 10.16, E is 
closed. 

Conversely, suppose that E is closed and that x, € E is Cauchy in E. Since 
the metrics on X and E are identical, {x,,} is Cauchy in X. Since X is complete, 
it follows that x, > x,asn — oo, for some x € X. But E is closed, so x must 
belong to E. Thus E is complete by definition. a 


EXERCISES 


10.1.1. Ifa, b € X and p(a, b) < « for all ¢ > 0, prove that a = b. 
10.1.2. Prove that {x;,} is bounded in X if and only if supyen p(xK, a) < 00 for 
allae X. 


10.1.3. Let R” be endowed with the usual metric and suppose that {x;} is a 


sequence in R” with components ay ): that is, 


xe = (26, 2,2), 


10.1.4. 


10.1.5. 


10.1.6. 


10.1.7. 
10.1.8. 


10.1.9. 


10.1.10. 
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a) Use Remark 8.7 to prove that {x;} is bounded in R” if and only if 
there is a C > O such that [| < C for allk € N and all j ¢€ 
{1,2,...,n}. 

b) Leta € R”. Prove that x; — aasn — oo if and only if x 
k — oo, for every j € {1,2,...,n}. 

c) Find the limit of each of the following sequences. 


e+] Vk +2 
(.e'"") Hao ee | pee ace 
1—-k Vie —k +1 


kk N* 1 /4Fa 
EEL) USE SR a1 


a) Leta € X. Prove that if x, = a for every n € N, then x, converges. 
What does it converge to? 

b) Let X = R with the discrete metric. Prove that x, > a asn > ooif 
and only if x, = a for large n. 


? — dk, as 


a) Let {x,} and {y,} be sequences in X which converge to the same 
point. Prove that p(x, yn) > O0asn > oo. 
b) Show that the converse of part a) is false. 


Let {x,} be Cauchy in X. Prove that {x,} converges if and only if at least 
one of its subsequences converges. 

Prove that the discrete space R is complete. 

a) Prove that the metric space C[a, b] in Example 10.6 is complete. 


b) Let If ll =f’ |f @)|dx and define 


dist(f, g) := If — gl 


for each pair f, g € C[a, b]. Prove that this distance function also 
makes C[a, b] a metric space. 
c) Prove that the metric space C[a, b] defined in part b) is not complete. 


a) Show that if x € B,(a), then there is an ¢ > O such that the closed 
ball centered at x of radius « is a subset of B,(a). 

b) Ifa 4 b are distinct points in X, prove that there is anr > 0 such 
that B,(a) N B,(b) = @. 

c) Show that given two balls B,(a) and B;(b), and a point x € B,(a)N 
B;(b), there are radii c and d such that 


Be(x) © B-(a) 0 Bs(b) and Ba(x) 2 B-(a) U Bs(). 


a) A subset E of X is said to be sequentially compact if and only if 
every sequence x, € E has a convergent subsequence whose limit 
belongs to E. Prove that every sequentially compact set is closed 
and bounded. 
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b) Prove that R is closed but not sequentially compact. 
c) Prove that every closed bounded subset of R is sequentially 
compact. 


10.1.12. Prove Theorem 10.14. 


10.2 INTERIOR, CLOSURE, AND BOUNDARY 


Thus far, we have used open and closed mostly for identification. At this point, 
we begin to examine these concepts in more depth. Our first result shows that 
open sets and closed sets behave very differently with respect to unions and 
intersections. 


10.22 Theorem. Let X be a metric space. 


i) If {Va}aea is any collection of open sets in X, then 


Uv 


acA 


is open. 
ii) Uf {Ve :k =1,2,...,n} is a finite collection of open sets in X, then 


n 
() Vix () Vie 
k=1 ke{l,2,...,n} 


is open. 
iti) If {Ew}aea is any collection of closed sets in X, then 


al 


acA 


is closed. 
iv) If{E,:k =1,2,...,n} is a finite collection of closed sets in X, then 


n 
(jag LR 
k=1 ke{1,2,...,2} 


is closed. 
v) If Vis open in X and E is closed in X, then V \ E is open and E \ V is closed. 


Proof. i) Let x € Uge, Vo. Then x € Vy for some a € A. Since Vy is open, 
it follows that there is anr > 0 such that B,(x) C Vy. Thus B,(x) © Ugeg Va 
(i.e., this union is open). 

ii) Let x € (Vp) Ve. Then x € VY fork = 1,2,...,n. Since each Vy is 
open, it follows that there are numbers rz, > 0 such that B,,(x) C Vy. Let 
r=min{r),...,7r,}. Thenr > 0 and B,(x) C V; for allk = 1, 2,...,n; that is, 
B(x) © ae | Vk. Hence, this intersection is open. 
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iii) By DeMorgan’s Law (Theorem 1.36) and part i), 


(Me) =Uss 


acA acA 


is open, SO (<4 Eq is closed. 
iv) By DeMorgan’s Law and part ii), 


n c n 
(Us) =()# 
k=1 k=1 


is open, so Ur_, Ex is closed. 
v) Since V\ E = VE‘ and E\ V = ENV‘, the former is open by part 
ii), and the latter is closed by part iii). | 


The finiteness hypothesis in Theorem 10.22 is critical, even for the case X = R. 


10.23 Remark. Statements ii) and iv) of Theorem 10.22 are false if arbitrary col- 
lections are used in place of finite collections. 


Proof. In the metric space X = R, 


is closed and 


is open. | 


Theorem 10.22 has many applications. Our first application is that every set 
contains a largest open set and is contained in a smallest closed set. To facilitate 
our discussion, we introduce the following topological operations. 


10.24 Definition. 
Let E be a subset of a metric space X. 


i) The interior of E is the set 


E° =v : V C E and V is open in X}. 
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ii) The closure of E is the set 


E:=( \{B : B D E and B is closed in X}. 


Notice that every set E contains the open set and is contained in the closed 
set X. Hence, the sets E° and E are well defined. Also notice, by Theorem 10.22, 
that the interior of a set is always open and the closure of a set is always closed. 

The following result shows that E° is the largest open set contained in F, and 
E is the smallest closed set which contains E. 


10.25 Theorem. Let E C X. Then 


lb? Cre F, 
ii) if Vis openand V C E, then V © E®, and 
iti) if Cis closed and C D E, thenC D E. 


Proof. Since every open set V in the union defining E° is a subset of E, it is 
clear that the union of these V’s is a subset of E. Thus E° C E. A similar 
argument establishes E © E. This proves i). 

By Definition 10.24, if V is an open subset of E, then V C E° and if Cisa 
closed set containing FE, then E C C. This proves ii) and iii). a 


In particular, the interior of a bounded interval with endpoints a and b is 
(a,b), and its closure is [a,b]. In fact, it is evident by parts ii) and iii) that 
E = E° if and only if E is open and E = E if and only if E is closed. We shall 
use this observation many times below. 

The following examples illustrate the fact that the interior of a nice enough 
set E in R? can be obtained by removing all its “edges,” and the closure of E by 
adding all its “edges.” 


10.26 EXAMPLE. 


Find the interior and closure of the set E = {(x, y): -—1 < x < 1 and —|x| < 
y < |x]. 

Solution. Graph y = |x| and x = +1, and observe that EF is a bow-tie-shaped 
region with “solid” vertical edges (see Figure 9.2). Now, by Definition 10.8, any 


open set in R* must contain a disk around each of its points. Since E? is the 
largest open set inside E, it is clear that 


E° = {(x,y): -l <x <1land — |x| < y < |x]}. 


Similarly, 


E={(,y):-l <x <1land — |x| <y < |x}}. a 
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10.27 EXAMPLE. 


Find the interior and closure of the set E = B,(—2, 0) U B) (2, 0) U{(x, 0): -1 < 
x <1}. 


Solution. Draw a graph of this region. It turns out to be “dumbbell shaped”: 
two open disks joined by a straight line. Thus E° = B,(—2, 0) U Bi (2, 0) and 


E = Bi(—2,0) UB, 0) U{(x, 0): -1 <x <1}. | 


One of the most important results from one-dimensional calculus is the 
Fundamental Theorem of Calculus. It states that the behavior of a derivative 
f’ on an interval [a, b], as measured by the integral, is completely determined 
by the values of f at the endpoints of [a, b]. What shall we use for “endpoints” 
of an arbitrary set in X? Notice that the endpoints a, b are the only points which 
lie near both [a, b] and the complement of [a, b]. Using this as a cue, we intro- 
duce the following concept. 


10.28 Definition. 
Let E C X. The boundary of E is the set 


dE :={x €X: forallr>0, B-(x)NE A@and B(x) N ES FG}. 


[We will refer to the last two conditions in the definition of 0 £ by saying B, (x) 
intersects E and E°.| 


10.29 EXAMPLE. 
Describe the boundary of the set 


E ={(x, y):x* + y? <9 and (x — 1)(y +2) > O}. 


Solution. Graph the relations x? 4+ y? = 9 and (x — 1)(y + 2) = 0 to obtain a 
region with solid curved edges and dotted straight edges (see Figure 9.3). By 
definition, then, the boundary of E is the union of these curved and straight 
edges (all made solid). Rather than describing dE analytically (which would 
involve solving for the intersection points of the straight lines x = 1, y = —2, 
and the circle x* + y* = 9), it is easier to describe 4 E by using set algebra. 


JE = {(x, y) 1x7 +y? <9 and (x — 1)(y +2) = 0} 
\ {(x, y) x7 + y? <9 and (x — 1)(y +2) > 0} a 


It turns out that set algebra can be used to describe the boundary of any set. 
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10.30 Theorem. Let E C X. Then dE = E\E°. 
Proof. By Definition 10.28, it suffices to show 


x € E if and only if B,(x) NE £9 for allr > 0, and (1) 
x ¢ E° if and only if B(x) N E® 4G for allr > 0. (2) 


We will provide the details for (2) and leave the proof of (3) as an exercise. 
Suppose that x € E but B,,.(x) ON E = @ for some ro > 0. Then (B,,(x))° is 
a closed set which contains E; hence, by Theorem 10.25iii, E C (B,)(x))°. It 
follows that E 9 B,,.(x) = @ (e.g., x ¢ E, a contradiction). 

Conversely, suppose that x ¢ E. Since (E)° is open, there is an 9 > 0 
such that B,,.(x) © (£)*. In particular, 6 = B,.(x) NE D> B,.(x) NE for 
some ro > 0. | 


We have introduced topological operations (interior, closure, and boundary). 
The following result answers the question, How do these operations interact 
with the set operations (union and intersection)? 


10.31 Theorem. Let A, B € X. Then 


i) (AUB)? D A°U B®, (ANB)? = APN B®, 


ii) AUB=AUB, ANBCANB, 


iii) (AUB) C9AUOB, and a(ANB) C(ANAB)U(BNA|A)U(GANAB). 


Proof. i) Since the union of two open sets is open, A° U B® is an open subset 
of A U B. Hence, by Theorem 10.2511, A° U B? C (AU B)?. 

Similarly, (ANB)? > A°N B°. On the other hand, ifV C ANB,thenV CA 
and V c B. Thus, (AM B)? € A°N B?. 

ii) Since A U B is closed and contains A U B, it is clear that by Theo- 
rem 10.25iii), AU B C AUB. Similarly, AN B C AN B. To prove the reverse 
inequality for union, suppose that x ¢ AUB. Then there is a closed set E 
which contains A U B such that x ¢ E. Since E contains both A and B, it 
follows that x ¢ A and x ¢ B. This proves part ii). 

iii) Let x € 0(A U B); that is, suppose that B,(x) intersects both A U B and 
(AUB) for allr > 0. Since (AU B)*° = ASN B*, it follows that B,(x) intersects 
both A‘ and B* for all r > 0. Thus, B,(x) intersects A and A° for all r > 0, or 
B,(x) intersects B and B¢ for allr > 0 (.e.,x € dAUOB). This proves the first 
set inequality in part iii). 

To prove the second set inequality, fix x € (AN B) [i.e., suppose that B,. (x) 
intersects AM B and (AN B)* for allr > O]. If x € (AN |B) U(BN OA), 
then there is nothing to prove. If x ¢ (AM 0B) U (BN OA), then 
x € (ASU(0B)°)N (BS U(0A)*). Hence, it remains to prove that x € A°U(dB)*° 
implies x € 0A and x € B® U (0A)*° implies x € 0B. By symmetry, we need 
only prove the first implication. 

Case 1. x € A°. Since B;(x) intersects A, it follows that x € dA. 
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Case 2. x € (dB)°. Since B;(x) intersects B, it follows that B,-(x) C B for 
small r > 0. Since B,(x) also intersects A° U B°, it must be the case that B,(x) 


intersects A‘. In particular, x € 0A. a 
EXERCISES 
10.2.1. Find the interior, closure, and boundary of each of the following sub- 


10.2.2. 


10.2.3. 


10.2.4. 


10.2.5 |. 


10.2.6. 
10.2.7. 


10.2.8 |. 


10.2.9. 


sets of R. 
a) E={1/n:neéeN} 
1 1 
b) E=U", | —.- 
c) E=U(-n,n) 
d) E=Q 
Identify which of the following sets are open, which are closed, and 
which are neither. Find E°, E, and 0£ and sketch E in each case. 


a) E = {(x, y) : 2x? + 6y? < 3} 

b) E={(x,y):x*+4x + y? =5}U {(x, 0): x €[7, 9} 
c) B=(Gi yiiy = 8x" 0 S'y= 2} 

d) E={(x,y):x*-y*<4,-2<y <2} 


Leta eX, s<r, 
V={xeX:s < p(x,a) <r}, and E={xeX:s < o(x,a) <r}. 


Prove that V isopenand E isclosed. __ 
Suppose that A C B C X. Prove that A C B and A° C B?. 


This exercise is used in Section 10.5. Show that if E is nonempty and 
closed in X anda ¢ E, then infyez p(x, a) > 0. 

Prove (3). 

Show that Theorem 10.31 is best possible in the following sense. 


a) There exist sets A, B in R such that (A U B)? 4 A° U B®. 

b) There exist sets A, B in Rsuch that ANB A ANB. 

c) There exist sets A, B in R such that 0(A U B) #4 0A U OB and 
d(AN B) 4 (ANIAB)U(BN|DA)U(MAN AB). 


This exercise is used many times from Section 10.5 onward. Let Y be a 
subspace of X. 


a) Show that a set V is open in Y if and only if there is an open set U 
in X such that V =UNY. 

b) Show that a set E is closed in Y if and only if there is a closed set A 
in X suchthatE = ANY. 


Let f :R — R. Prove that f is continuous on R if and only if f~!(/) 
is open in R for every open interval J. 
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10.2.10. Let V be a subset of X. 


a) Prove that V is open in X if and only if there is a collection of open 
balls {By : a € A} such that 


= |) sBy: 
acA 


b) What happens to this result if open is replaced by closed? 
10.2.11. Let E C X be closed. 


a) Prove that dE C E. 
b) Prove that dE = E if and only if E° = 9%. 
c) Show that b) is false if E is not closed. 


10.3 COMPACT SETS 


In Chapter 3 we proved the Extreme Value Theorem for functions defined on 
R. In this section we shall extend that result to functions defined on an arbi- 
trary metric space. To replace the hypothesis “closed, bounded interval” used in 
the real case, we introduce “compactness,” a concept which gives us a powerful 
tool for extending local results to global ones (see especially Remark 10.35 and 
Theorems 10.43, 10.69, and 12.46). 

Since compactness of E depends on how E can be “covered” by a collection 
of open sets, we begin by introducing the following terminology. 


10.32 Definition. 


Let V = {Vu}aea be a collection of subsets of a metric space X and suppose 
that E is a subset of X. 


i) V is said to cover E (or be a covering of E) if and only if 


Ee |) | Ve, 


acA 


ii) V is said to be an open covering of EF if and only if V covers EF and each Vy 
is open. 

iti) Let V be a covering of E. V is said to have a finite (respectively, countable) 
subcovering if and only if there is a finite (respectively, countable) subset 
Ao of A such that {Vo}aedy covers E. 


Notice that the collections of open intervals 


levee ™ Cee) 
—, -—_ and —-, —— 
k+l k+1/ Jpen kok keN 


are open coverings of the interval (0, 1). The first covering of (0, 1) has no finite 
subcover, but any member of the second covering covers (0, 1). Thus an open 
covering of an arbitrary set may or may not have a finite subcovering. 
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Sets that always have finite subcoverings are important enough to be given 
a name. 


10.33 Definition. 


A subset H of a metric space X is said to be compact if and only if every open 
covering of H has a finite subcover. 


To get a feeling for what this definition means, we make some elementary 
observations concerning compact sets in general. 


10.34 Remark. The empty set and all finite subsets of a metric space are compact. 


Proof. These statements follow immediately from Definition 10.33. The 
empty set needs no set to cover it, and any finite set H can be covered by 
finitely many sets, one set for each element in H. a 


Since the empty set and finite sets are also closed, it is natural to ask whether 
there is a relationship between compact sets and closed sets in general. The 
following three results address this question in an arbitrary metric space. 


10.35 Remark. A compact set is always closed. 


Proof. Suppose that H is compact but not closed. Then H is nonempty and 
(by Theorem 10.16) there is a convergent sequence x, € H whose limit x does 
not belong to H. For each y € H, set r(y) := p(x, y)/2. Since x does not 
belong to H, r(y) > 0; hence, each B,(,)(y) is open and contains y; that is, 
{B-()(y) : y € H} is an open covering of H. Since H is compact, we can 
choose points y; and radii rj := r(y;) such that {B,,(yj) : j = 1,2,...,N} 
covers H. 

Setr := min{r),..., ry}. (This is a finite set of positive numbers, so r is also 
positive.) Since x, > x ask — ov, x, € B,(x) for large k. But x, € B(x) NH 
implies xz € B, (yj) for some j € N. Therefore, it follows from the choices of 
r; and r, and from the Triangle Inequality, that 


rj = P(x, yj) > p(x, yp — P(x, X) 
= 2rj — p(xn,x) > 2rj —r = rj —rj =N;, 


a contradiction. |_| 


The following result is a partial converse of Remark 10.35 (see also Theo- 
rem 10.41 below). 


10.36 Remark. A closed subset of a compact set is compact. 


Proof. Let E be a closed subset of H, where H is compact in X and suppose 
that V = {Va}wea iS an open covering of E. Now E° = X\E is open; hence, 
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Y U {E*‘} is an open covering of H. Since H is compact, there is a finite set 
Ao © A such that 


HCECU L Va 


aecAg 


But EN E° = Y. Therefore, E is covered by {Va}weAg: 1) 
Here is the connection between closed, bounded sets and compact sets. 


10.37 Theorem. Let H be a subset of a metric space X. If H is compact, then H 
is closed and bounded. 


Proof. Suppose that H is compact. By Remark 10.35, H is closed. It is also 
bounded. Indeed, fix b € X and observe that {B,(b) : n € N} covers X. Since 
H is compact, it follows that 


N 
HC |J Bald) 


n=1 


for some N EN. Since these balls are nested, we conclude that H C By(b) 
(i.e., H is bounded). a 


10.38 Remark. The converse of Theorem 10.37 is false for arbitrary metric 
spaces. 


Proof. Let X = R be the discrete metric space introduced in Example 10.3. 
Since o(0,x) < 1 for all x € R, every subset of X is bounded. Since x, > x 
in X implies x, = x for large k, every subset of X is closed. Thus [0, 1] is a 
closed, bounded subset of X. Since {x},<jo,1] is an uncountable open covering 
of [0, 1], which has no finite subcover, we conclude that [0, 1] is closed and 
bounded, but not compact. a 


The problem here is that the discrete space has too many open sets. To iden- 
tify a large class of metric spaces for which the converse of Theorem 10.37 
DOES hold, we need a property which cuts the “number of essential” open 
sets down to a reasonable size. 


10.39 Definition. 


A metric space X is said to be separable if and only if it contains a countable 
dense subset (i.e., if and only if there is a countable set Z of X such that for 
every point a € X there is a sequence x, € Z such that x, > a ask —> oo). 
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It is easy to see (Theorem 9.32) that all Euclidean spaces are separable. The 
space C[a, b] is also separable (see Exercise 10.7.1). Hence, the hypothesis of 
separability is not an unusual requirement. 

The following result makes clear what we meant above by “number of essen- 
tial” open sets. It shows that every open covering of a set in a separable metric 
space has a countable subcovering. 


10.40 Theorem. [LINDELOF]. 

Let E be a subset of a separable metric space X. If {Vu}aca is a collection of 
open sets and E © UweA Va, then there is a countable subset {a, 02,...} of A 
such that 


CO 
Be |) Va 
k=1 


Proof. Let Z be a countable dense subset of X, and consider the collection T 
of open balls with centers in Z and rational radii. This collection is countable. 
Moreover, it “approximates” all other open sets in the following sense: 

Cia: Given any open ball B(x) C X, there is a ball By (a) € T such that 
x € By(a) and By(a) © B;(x). 

PRroor oF CLaAim: Let B(x) C X be given. By Definition 10.39, choose 
a € Z such that p(x, a) < r/4, and choose by Theorem 1.18 a rational g € Q 
such that r/4 < q < r/2. Since r/4 < q, we have x € B,(a). Moreover, if 
y € B, (a), then 


r r r 
PS) SPO OSES Ar Se ee 


Therefore, B,(a) C B,(x). This establishes the claim. 
To prove the theorem, let x ¢ E. By hypothesis, x € Vy for some a € A. 
Hence, by the claim, there is a ball B, € T such that 


WEB, C Vy: (3) 
The collection TJ is countable; hence, so is the subcollection 

(U1, U2,...}:= {By ix € E}. (4) 

By (4), for each k € N there is at least one a; € A such that Ux C Vo,. Hence, 


by (5), 


[e,2) 


CO 
EC(JB=WuUCL Ve. a 
k=1 


xeE k=1 


We are prepared to obtain a converse of Theorem 10.37. (For the definition 
of the Bolzano—Weierstrass Property, see Exercise 10.5.9.) 
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10.41 Theorem. [HEINE-BOREL]. 

Let X be a separable metric space which satisfies the Bolzano—Weierstrass 
Property and H be a subset of X. Then H is compact if and only if it is closed 
and bounded. 


Proof. By Theorem 10.37, every compact set is closed and bounded. 

Conversely, suppose to the contrary that H is closed and bounded but not 
compact. Let V be an open covering of H which has no finite subcover of H. 
By Lindel6f’s Theorem, we may suppose that V = {Vx}xen; that is, 


Hc U Vi. (5) 


keEeN 


By the choice of Y, Ut_sVj cannot contain H for any k € N. Thus we can 
choose a point 


k 
xe H\)V; (6) 


j=l 


for each k € N. Since H is bounded, the sequence x, is bounded. Hence, 
by the Bolzano—Weierstrass Property, there is a subsequence x, which con- 
verges to some x as v > oo. Since H is closed, x € H. Hence, by (6), x € Vy 
for some N €N. But Vy is open; hence, there is an M € N such that v > M 
implies k, > N and x;, € Vy. This contradicts (7). We conclude that H is 
compact. a 


Since R” satisfies the hypotheses of Theorem 10.41 (see Theorems 9.32 and 
9.34), it follows that a subset of a Euclidean space is compact if and only if it is 
closed and bounded. 

We now turn our attention to uniform continuity on an arbitrary metric 
space. 


10.42 Definition. 


Let X be a metric space, E be a nonempty subset of X, and f : E > Y. Then 
f is said to be uniformly continuous on E (notation: f : E > Y is uniformly 
continuous) if and only if given e > 0 there is a 5 > 0 such that 


p(x,a)<6 and x,aeE imply t(f(x), f@) <e. 


In the real case, we proved that uniform continuity and continuity were equiv- 
alent on closed, bounded intervals. That result, whose proof relied on the 
Bolzano—Weierstrass Theorem, is not true in an arbitrary metric space. If we 
strengthen the hypothesis from closed and bounded to compact, however, the 
result is valid for any metric space. 
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10.43 Theorem. Suppose that E is a compact subset of X and that f : X — Y. 
Then f is uniformly continuous on E if and only if f is continuous on E. 


Proof. If f is uniformly continuous on a set, then it is continuous whether or 
not the set is compact. 

Conversely, suppose that f is continuous on E. Givene > Oanda é€ E, 
choose 6(a) > 0 such that 


x €Byq(a) and x€E imply rt(f(x), f(a) < > 


Since a € B;(a) for all 5 > 0, it is clear that {Bs(q)/2(a) : a € E} is an open 
covering of E. Since E is compact, choose finitely many points aj € E and 
numbers 46; := 6(a;) such that 


N 
EC |) Bs,2(a;). (7) 


Set 6 := min{d,/2,..., dy /2}. 
Suppose that x,a € E with p(x,a) < 5. By (8), x belongs to Bs, /2(aj) for 
some 1 < j < N. Hence, , 


On 8) 
pla, aj) < o(a, x) + p(x, aj) < a =e os = 8;3 
that is, a also belongs to B3,(a;). It follows, therefore, from the choice of 6; 
that 


t(f(x), fla) < t(f (x), f(aj)) + tf (aj), F(@) < > - 5 =e. 


This proves that f is uniformly continuous on E. >) 


EXERCISES 


10.3.1. Identify which of the following sets are compact and which are not. If 
E is not compact, find the smallest compact set H (if there is one) such 
that E CH. 


a) {k/(k+2):k EN}U {I} 

b) {(x, y) € R?:a < x3 + y* < 5} for real numbers 0 < a <b 
c) {(x, y) € R* : y =cos(1/x) andx € (0, 1}} 

d) {(x, y) € R®: |xy| < 4} 


10.3.2. Let A, B be compact subsets of X. Prove that A U B and AN B are 
compact. 

10.3.3. Suppose that E C R is compact and nonempty. Prove that 
sup E, inf FE € E. 
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10.3.4. 


10.3.5. 


10.3.6. 


10.3.7. 


10.3.8. 


10.3.9. 


10.3.10. 


Metric Spaces 


Suppose that {Vo}vea is a collection of nonempty open sets in X which 
satisfies Vy 1 Vg = @ for alla ¢ 6 in A. Prove that if X is separable, 
then A is countable. What happens to this result when open is omitted? 
Prove that if V is open in a separable metric space X, then there are 
open balls B,, Bo, ... such that 


Vi |B 


JEN 


Prove that every open set in R is a countable union of open intervals. 
Suppose that X is a separable metric space which satisfies the Bolzano-— 
Weierstrass Property, that Y is a complete metric space, and that E is 
a bounded subset of X. Prove that a function f : E — Y is uniformly 
continuous on E if and only if f can be continuously extended to E; 
that is, if and only if there exists a continuous function g : E — Y such 
that f(x) = g(x) forallx € E. 

Suppose that X satisfies the Bolzano—Weierstrass Property and that A 
and B are compact subsets of X. Prove that if AM B = @ and if 


dist (A, B) :=inf{o(x,y):x €A and ye B}, 


then dist (A, B) > 0. Show that even in the space R’, there exist subsets 
A and B which are closed and satisfy AN B = @, but dist(A, B) = 0. 


a) Prove that Cantor's Intersection Theorem holds for nested compact 
sets in an arbitrary metric space; that is, if Hj, Ho, ... is a nested 
sequence of nonempty compact sets in X, then 


()\ Hk #9. 


k=1 


b) Prove that (./2, /3) MQ is closed and bounded but not compact in 
the metric space Q introduced in Example 10.5. 

c) Show that Cantor’s Intersection Theorem does not hold in an arbi- 
trary metric space if compact is replaced by closed and bounded. 


Prove that the Bolzano—Weierstrass Property does not hold for C[a, b] 
and || {|| (see Example 10.6). Namely, prove that if f,(x) = x”, then 
| fn\| is bounded but || f,, — f|| does not converge for any f € C[0, 1] 
and any subsequence {nx}. 

Let X be a metric space. 


a) Prove that if E C X is compact, then E is sequentially compact (see 
Exercise 10.1.10). 

b) Prove that if X is separable and satisfies the Bolzano—Weierstrass 
Property, then a set E C X is sequentially compact if and only if it is 
compact. 
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10.4 CONNECTED SETS 


We have introduced open sets (analogues of open intervals), closed sets (ana- 
logues of closed intervals), and compact sets (analogues of closed bounded 
intervals) in order to develop a calculus of functions of several variables, in 
Chapters 11 through 13, which parallels that developed for functions of a single 
variable in Chapters 2 through 5. Some of the earlier theory, however, depended 
on a property of intervals not yet discussed. For example, the proof of the Inter- 
mediate Value Theorem tacitly used the fact that an interval is connected (i.e., is 
unbroken and all of one piece). We shall also use connected sets in Chapter 13 to 
provide a sufficiently broad definition of surfaces for computational ease. Thus 
we introduce the following idea. 


10.44 Definition. 
Let X be a metric space. 


i) A pair of nonempty open sets U, V in X is said to separate X if and only if 
X=UUVandUNV=¥. 

ii) X is said to be connected if and only if X cannot be separated by any pair 
of open sets U,V. 


Loosely speaking, a connected space is all in one piece (i.e., cannot be broken 
into smaller, nonempty, open pieces which do not share any common points). 
Indeed, we shall see that R, under the usual metric, is connected. On the other 
hand, under the discrete metric, R is not connected (since (—oo, 0] and (0, co) 
are both “open” in the discrete space). 

Recall (Example 10.4) that every subset of X is a metric space. Hence Defini- 
tion 10.44 also defines what it means for a subset E of X to be connected. We can 
always find two subsets of an arbitrary metric space which are connected: 1) The 
empty set is connected, since it can never be written as the union of nonempty 
sets. 2) Every singleton E = {a} is also connected since, if E = U UV, where 
both U and V are nonempty, then E has at least two points. 

To obtain deeper results about connectivity, it is convenient to introduce the 
following concepts. (These concepts will also be used to study continuous func- 
tions in the next section.) 


10.45 Definition. 
Let X be a metric space and E C X. 


i) Aset U C E is said to be relatively open in E if and only if there is a set V 
open in X such thatU = ENV. 

ii) Aset A C E is said to be relatively closed in E if and only if there is a set C 
closed in X such that A= ENC. 


For example, the set E of Example 10.26 is relatively open in the subspace 
Y := {(x, y): -—1 < x < 1} and relatively closed in the subspace Z := {(x, y) : 
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—|x| < y < |x|}. Indeed, V = Z is open in R? (it contains none of its boundary), 
A = Y is closed in R? (it contains all its boundary), and E=VNY, E=AQNZ. 

Recall (Exercise 10.2.8) that a subset A of E is open (respectively, closed) in 
the subspace E if and only if it is relatively open (respectively, relatively closed) 
in the set E. Thus all Definition 10.45 does is codify the “subspace topology.” 

By Definition 10.44, then, a set E is connected if there are no nonempty sets 
U, V, relatively open in E, such that E = UUV andUNV = &. The follow- 
ing result, which is usually easier to use than Definition 10.44, shows that when 
“separating” a nonconnected set, we can use open sets instead of relatively open 
sets. (The converse of this result is also true, but harder to prove—see Theo- 
rem 10.48 below.) 


10.46 Remark. Let E C X. If there exists a pair of open sets A, B in X which 
separate E (i.e, ifE CAUB, ANB=%, ANE £G,and BN E £9), then E is 
not connected. 


Proof. Set U = ANE and V = BNE. It suffices to prove that U and V are 
relatively open in E and separate E. It is clear by hypothesis and the remarks 
above that U and V are nonempty, they are both relatively open in E, and 
UNV = 49. It remains to prove that E = U UV. But E isa subset of AU B, so 
E CUUYV. On the other hand, both U and V are subsets of E,so E DUUV. 
We conclude that E =U UV. a 


Thus when looking for “separations” of a given set E Cc X, we can confine our 
attention to open sets in X. Here are several examples. The set Q is not con- 
nected since the pair A = (—ox, J2), B = (V2, 00) separate Q. Example 10.26 
is not connected since {(x, y) : x < 0} and {(x, y) : x > 0} are open in R? (nei- 
ther of them contains any of their boundary points) and separate the bow tie set 
E. Notice that Examples 10.27 and 10.29 are both connected in R?. 

There is a simple description of all connected subsets of R. 


10.47 Theorem. A subset E of R is connected if and only if E is an interval. 


Proof. Let E be a connected subset of R. If E is empty or contains only one 
point, then £ is a degenerate interval. Hence we may suppose that EF contains 
at least two points. 

Set a = inf E and b = supE. Notice that —oo < a < b < oo. Suppose 
for simplicity that a,b ¢ E; that is, E C (a,b). If E 4 (a,b), then there is an 
x € (a, b) such that x ¢ E. By the Approximation Property, EN (a, x) #4 @ and 
EN (x, b) # @, and, by assumption, FE C (a, x) U (x, b). Hence, E is separated 
by the open sets (a, x), (x, b), a contradiction. 

Conversely, suppose that E is an interval which is not connected. Then 
there are sets U, V, relatively open in E, which separate E (1¢e., E = U UV, 
UNV = 4), and there exist points x; €¢ U and x2 € V. We may suppose that 
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x1 < x2. Since x1, x2 € E and E is an interval, Jo := [x1, x2] C E. Define f on 
Ip by 


xEeU 


0 
£G)— 1 xeV. 
Since UNV = @, f is well defined. We claim that f is continuous on Jp. 
Indeed, fix x9 € [x1], x2]. Since U UV = E 3D J), it is evident that x9 <¢ U 
or x9 € V. We may suppose the former. Let yz € Jo and suppose that y, > xo 
as k — oo. Since U is relatively open, there is an e > 0 such that (xp — €, x9 + 
e)NE CU. Since yy € E and yg > xo, it follows that yy € U for large k. 
Hence f (yp) = 0 = f (xo) for large k. Therefore, f is continuous at xo by the 
Sequential Characterization of Continuity. 

We have proved that f is continuous on Jpg. Hence, by the Intermediate 
Value Theorem (Theorem 3.29), f must take on the value 1/2 somewhere on 
Ip. This is a contradiction, since by construction, f takes on only the values 0 
or 1. | 


We can use this result to prove that a real function is continuous on a closed, 
bounded interval if and only if its graph is closed and connected (see Theo- 
rem 9.71 in the preceding chapter). 


We close this section by showing that the converse of Remark 10.46 is also true. 
This result is optional because we do not use it elsewhere. 


*10.48 Theorem. Let E C X. If there exist sets U, V, relatively open in E, such 
thattUNV =%, E=UUV, U £¥G, and V £ &, then there is a pair of open sets 
A, B which separates E. 


Proof. We first show that 
UNV =@. (8) 


Indeed, since V is relatively open in £, there is a set Q, open in X, such that 
V=ENQ. Since UNV = J, it follows that U Cc Q°. This last set is closed in 
X. Therefore, 


UCH=X; 


’ 


that is, (9) holds. 
Next, we use (9) to construct the set B. Set 


by = inf{o(x,u):ueU}, xeV, and B= |J Bs,/2(x). 


xeV 


Clearly, B is open in X. Since 6, > 0 for each x ¢ U (see Exercise 10.2.5), 
B contains V; hence, BM E D V. The reverse inequality also holds since by 
construction BU = % and by hypothesis E = U UV. Therefore, BN E = V. 
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Similarly, we can construct an open set A such that AN E = U by setting 


ey =inf{o(v,y):vEV}, yeU and A= U Be, /2(y)- 
yeU 


To prove that the pair A, B separates E, it remains to prove that AN B = @. 
Suppose to the contrary that there is a pointa € AN B. Thena € Bs, /2(x) 
for some x € V anda € Bz,/2(y) for some y € U. We may suppose that 
dy < €y. Then 


bx 


by 
p(x, y) < p(x,a)+ pla, y) < o + 7 < &y. 


Therefore, p(x, y) < inf{p(v, y) : v € V}. Since x € V, this is impossible. We 


conclude that AN B =U. a 
EXERCISES 
10.4.1. a) Leta < b andc < d be real numbers. Sketch a graph of the 
rectangle 


[a, b] x [c, d] := {(, y) : x € [a, b], y € [c, d]}, 


and decide whether this set is compact or connected. Explain 
your answers. 
b) Sketch a graph of set 


B,(—2, 0) U Bi (2,0) U {(x, 0): -1 <x < J}, 


and decide whether this set is compact or connected. Explain 
your answers. 


10.4.2. a) Sketch a graph of the set 
fons)e ye hay = 199 2 0}, 


and decide whether this set is relatively open or relatively closed 
in the subspace {(x, y) : y > 0}. Do the same for the subspace 
{(x, y) :2x* + 3y? < 12}. Explain your answers. 


b) Sketch a graph of the set 
(@y)ia? ty? <4,2° + (y— 2)" <2}, 
and decide whether this set is relatively open or relatively closed 


in the subspace B> (0, 0). Do the same for the subspace B vz (0, 2). 
Explain your answers. 


10.4.3. 


10.4.4. 


10.4.5. 


10.4.6. 


10.4.7 . 


10.4.8. 


10.4.11. 
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a) Prove that the intersection of two connected sets in R is con- 
nected. Show that this is false if R is replaced by R?. 

b) Generalize part a) as follows. If {Ey}ye,4 is an arbitrary collection 
of connected sets in R, then 


al 


acA 


is also connected. 
Prove that if E C R is connected, then E°? is also connected. Show 
that this is false if R is replaced by R?. 
Suppose that EF C X is connected and that EF C A C E. Prove that A 
is connected. 
Suppose that X and Y are metric spaces and that f : X — Y. If X is 
compact and connected, and if to every x € X there corresponds an 
open ball B, such that x € B, and f(y) = f(x) for all y € B,, prove 
that f is constant on X. 
This exercise is used in Section 10.6. Let H C X. Prove that A is 
compact if and only if every cover {Ey}vea Of H, where the E,’s are 
relatively open in H, has a finite subcover. 
A set E ina metric space is called clopen if it is both open and closed. 


a) Prove that every metric space has at least two clopen sets. 
b) Prove that a metric space is connected if and only if it contains 
exactly two clopen sets. 


Let X be a metric space. Prove that X is connected if and only if 
every nonempty proper subset of X has a nonempty boundary. 


This exercise is used to prove “Corollary 11.35. 


a) Aset E C R" is said to be polygonally connected if and only if 
any two points a,b € E can be connected by a polygonal path 
in E; that is, there exist points x, ¢ E, k = 1,..., N, such that 
Xp =a, Xy = band L(x,_|;x,) C E fork =1,..., N. Prove that 
every polygonally connected set in R” is connected. 

b) Let E C R” be open and xg € E. Let U be the set of points 
x € E which can be polygonally connected in E to xo. Prove that 
U is open. 

c) Prove that every open connected set in R” is polygonally con- 
nected. 


Suppose that {Eq}ve, is a collection of connected sets in a metric 
space X such that Nvea Eg 4 Y. Prove that 


ES |B, 


acA 


is connected. 
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10.5 LIMITS OF FUNCTIONS 


In the preceding section we used results in Chapter 2 as a model for the theory 
of limits of sequences in an arbitrary metric space X. In this section we use 
results in Chapter 3 as a model to develop a theory of limits of functions which 
take one metric space X to another Y. 

A straightforward adaptation of Definition 3.1 leads us to guess that, in an 
arbitrary metric space, f(x) — Las x — aif for every e > 0 there isad > 0 
such that 


0 < p(x,a) <6 implies t(f(x),L) <e. 


The only problem with this definition is that there may be no x which satisfies 
0 < p(x, a) < 6; for example, if X is the set N together with the metric p(x, y) = 
|x — y| and 6 = 1. To prevent our theory from collapsing into the vacuous case, 
we introduce the following idea. 


10.49 Definition. 


A point a € X is said to be a cluster point (of X) if and only if Bs(a) contains 
infinitely many points for each 6 > 0. 


For example, every point in any Euclidean space R” is a cluster point (of R”). 

Notice that any concept defined on a metric space X is automatically defined 
on all subsets of X. Indeed, since any subset F of X is itself a metric space (see 
Example 10.4 above), the definition can be applied to EF as well as to X. 

To be more specific, let E be a subspace of X (i.e., a nonempty subset of X). 
By Definition 10.7 an open ball in E has the form 


BE (a) := {x € E: p(x,a) <r}. 
Since the metrics on X and E are the same, it follows that 
BF (a) = Ba) NE, 


where B,(a) is an open ball in X. A similar statement holds for closed 
balls. We shall call these balls relative balls (in E). In particular, in the sub- 
space Q of Example 10.5 above, the relative open balls take on the form 
B-(a) = (a—r,a+r)OQ and the relative closed balls the form [a —r,a+r]NQ. 

What, then, does it mean for a set E to have a cluster point? By Defini- 
tion 10.49, a point a € X is acluster point of a nonempty set E C X if and only 
if the relative ball E M Bs(a) contains infinitely many points for each 6 > 0. 

The etymology of the term cluster point is obvious. A cluster point of E 
is a point near which E “clusters.” Cluster points are also called points of 
accumulation. 

Notice that, by definition, no finite set has cluster points. On the other hand, a 
set may have infinitely many cluster points. Indeed, by the Density of Rationals 
(Theorem 1.18), every point of R is a cluster point of Q. 

Here are two more examples of sets and their cluster points. 
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10.50 EXAMPLE. 
Show that 0 is the only cluster point of the set 


1 
B= {7 :n en}. 
n 


Solution. By Theorem 1.16 (the Archimedean Principle), given 6 > 0 there is 
an N € N such that 1/N < 6. Since n > N implies 1/n < 1/N, it follows that 
(—6, 5) N E contains infinitely many points. Thus 0 is a cluster point of E. 

On the other hand, if x9 4 0, then choose 6 < |xo|, and notice that either 
xo —6 > Oorx9 +6 < 0. Thus (xp — 6, x9 + 6) N E contains at most finitely many 
points (i.e., xo is not a cluster point of £). | 


10.51 EXAMPLE. 


Show that every point in the interval [0, 1] is a cluster point of the open interval 
(0, 1). 


Solution. Let xp € [0, 1] and 6 > 0. Then x9 +6 > Oand x9—6 < 1. In particular, 
(x9 —8, x9 +5) N(O, 1) is itself a nondegenerate interval, say (a, b). But (a, b) con- 
tains infinitely many points; for example, (a + b)/2, (Qa+b)/3, 3a+b)/4,.... 
Therefore, xo is a cluster point of (0, 1). a 


We are now prepared to define limits of functions on metric spaces. 


10.52 Definition. 


Let a be acluster point of X and f : X \{a} — Y. Then f(x) is said to converge 
to L, as x approaches a, if and only if for every ¢ > 0 there is a 6 > 0 such that 


0 < p(x,a) <6 implies t(f(x),L) <e. (9) 
In this case we write f(x) ~ Las x > a, or 


L= lim f(x), 
xa 


and call L the limit of f(x) as x approaches a. 


As in Chapter 3, we can introduce an algebraic structure on functions that 
take X to R”. Given f,g : X — R” andc é€ R, define the pointwise sum, 
pointwise product, and scalar product by 


(f+8)(x) = f@)+s@), (f8)(x) = f@)gx), and (cf)@) = cf(x), x eX. 
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When m = 1, define the pointwise quotient by 


(f/g)(%) = F(a) xeX and g(x) £0. 
g(x) 


By modifying the proofs presented in Chapter 3, we can prove the following 
results about limits of functions on metric spaces. 


10.53 Theorem. Leta be a cluster point of X and f, g : X \ {a} > Y. 


i) If f(x) = g(x) forall x € X \ {a} and f (x) has a limit as x — a, then g(x) 
also has a limit as x — a, and 


lim g(x) = lim f(x). 
xa x—7a 
ii) [SEQUENTIAL CHARACTERIZATION OF Limits]. The limit 
L:= lim f(x) 
xa 


exists if and only if f(x») > L asn — oo for every sequence xn € X \ {a} 
which converges to aas n — ov. 
iti) Suppose that Y = R". If f (x) and g(x) have a limit as x approaches a, then so 


do (f +g)(x), (f - 8)(x), (@f)(x), and (f/g)(x) [when Y = Rand the limit 
of g(x) is nonzero]. In fact, 


Jim (f + 8) (x) = lim f(x) + lim g(a), 
Jim (@f) @) = @ lim f(x), 
jim (f- 8) (x) = lim f(x) - lim g(x), 


and [when Y = Rand the limit of g(x) is nonzero] 


: (4) lim, +¢ f(@) 
lim { — | («) = ——————_.. 
g limy-+¢ g(x) 


iv) [SQUEEZE THEOREM FOR FuNCTIONS]. Suppose that Y =R. Ifh: X \ {a} > R 
Satisfies g(x) < h(x) < f(x) forallx € X \ {a}, and 


lim f(x) = lim g(x) = L, 
xa xa 
then the limit of h exists, as x — a, and 


lim h(x) = L. 
xa 
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v) [COMPARISON THEOREM FOR FuNCcTIONS]. Suppose that Y = R. If f(x) < g(x) 
for all x € X \ {a}, and if f and g have a limit as x approaches a, then 


lim f(x) < lim g(%). 
xXx7a xa 


At this point you should read Section 9.6 to see how these concepts play out 
in the concrete Euclidean space setting. Pay special attention to Theorem 9.41 
and Example 9.43 (which show how to evaluate a limit in R”) and Examples 9.44 
and 9.45 (which show how to prove that a specific limit in R” does not exist). 

Here is the metric space version of Definition 3.20. 


10.54 Definition. 
Let E be anonempty subset of X and f: E > Y. 


i) f is said to be continuous at a pointa € E if and only if given ¢ > 0 there is 
a é > Osuch that 


p(x,a) <6 and xe€E imply t(f(x), f@) <e. 


ii) f is said to be continuous on E (notation: f : E — Y is continuous) if and 
only if f is continuous at every x € E. 


Notice that this definition is valid whether a is a cluster point or not. Modify- 
ing corresponding proofs in Chapter 3, we can prove the following results. 


10.55 Theorem. Let E be anonempty subset of X and f,g:E— Y. 


i) f is continuous ata € E if and only if f(xn) > f(a), asn + o, for all 
sequences x, € E which converge to a. 

ii) Suppose that Y = R". If f, g are continuous at a point a € E (respectively, 
continuous on a set E), then so are f + g, f -g, andaf (foranya € R). 
Moreover, in the case Y = R, f/g is continuous ata € E when g(a) # 0 
[respectively, on E when g(x) £0 forall x € E}. 


The following result shows that the composition of two continuous functions 
is continuous. 


10.56 Theorem. Suppose that X, Y, and Z are metric spaces and that a is a cluster 


point of X. Suppose further that f : X > Y and g: f(X) > Z. If f(x) > Las 
x — aand gis continuous at L, then 


lim (g 0 f)(x) = g (lim f(x). 


We shall examine the metric space analogues of the Extreme Value Theorem, 
the Intermediate Value Theorem, and uniform continuity in Section 10.4. 
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EXERCISES 


10.5.1. 


10.5.2. 


10.5.3. 


10.5.4. 


10.5.5. 
10.5.6. 
10.5.7. 
10.5.8. 


10.5.9. 


Find all cluster points of each of the following sets. 


a) E=R\Q 

b) E=[a,b), a,beR,a<b 

c) E={(-1l)"n:neN} 

d) E = {x, :n € N}, where x, > x asn — co 
e) EB =41-1,3.4,0- 3.0, 23 4..-3 


a) A point a in a metric space X is said to be isolated if and only if there 
is anr > 0 so small that B,(a) = {a}. Show that a point a € X is not 
a cluster point of X if and only if a is isolated. 

b) Prove that the discrete space has no cluster points. 


Prove that a is a cluster point for some E C X if and only if there is a 
sequence x, € E \ {a} such that x, > aasn > o. 


a) Let EF be anonempty subset of X. Prove that a is a cluster point of E 
if and only if for each r > 0, EM B,(a) \ {a} is nonempty. 

b) Prove that every bounded infinite subset of R has at least one clus- 
ter point. 


Prove Theorem 10.53. 

Prove Theorem 10.55. 

Prove Theorem 10.56. 

Prove that if f, € C[a, b], then f,, — f uniformly on [a, b] if and only if 
tn > f inthe metric of C[a, b] (see Example 10.6). 

Suppose that X is a metric space which satisfies the following condition. 


10.57 Definition. 


X is said to satisfy the Bolzano—Weierstrass Property if and only if every 
bounded sequence x, € X has a convergent subsequence. 


a) Prove that if E is a closed, bounded subset of X and x, € EF, then there is an 
a € E and a subsequence x, of x, such that x,, > a ask —> oo. 

b) If E is closed and bounded in X and f : E > Ris continuous on E, prove 
that f is bounded on E. 

c) Prove that under the hypotheses of part b) that there exist points x», 
xu € E such that 


f (xm) = up f(x) and fm) = any Ff (x). 


10.6 CONTINUOUS FUNCTIONS 


In this 


section we discuss the behavior of images and inverse images of open 


sets, closed sets, compact sets, and connected sets under continuous functions. 
We shall use these results many times in the sequel. 
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Recall that if X and Y are metric spaces (with respective metrics p and Tr), then 
a function f : X — Y is continuous on X if and only if given a € X ande > 0 
there isa 6 > Osuch that p(x, a) < 6 implies t( f(x), f(a@)) < ¢; that is, such that 


Bs(a) C f~'(Be(f(@)). (10) 


This observation can be used to give the following simple but powerful charac- 
terization of continuous functions which can be stated without using the metric 
of X (see also Exercise 10.6.3). 


10.58 Theorem. Suppose that f : X — Y. Then f is continuous if and only if 
f—!(V) is open in X for every open V in Y. 


Proof. Suppose that f is continuous on X and that V is open in Y. We may 
suppose that f 1 (V) is nonempty. Let a € f—'(V); that is, f(a) € V. Since 
V is open, choose ¢ > 0 such that B,(f(a)) C V. Since f is continuous at a, 
choose 6 > 0 such that (10) holds. Evidently, 


Bs(a) C f7'(Be(f(a))) © f-'(V). (11) 


Since a € f~!(V) was arbitrary, we have shown that every point in f -l(V) is 
interior to f~!(V). Thus f~!(V) is open. 

Conversely, let e > 0 anda € X. The ball V = B,(f(a)) is open in Y. 
By hypothesis, f~!(V) is open. Since a € f~!(V), it follows that there is 
a dé > O such that Bs(a) C f~!(V). This means that if p(x,a) < 6, then 
t(f(x), f(a)) < e. Therefore, f is continuous ata € X. 5 


By using the subspace (i.e., relative) topology, we see that Theorem 10.58 
contains the following criterion for f to be continuous on a subset of X. 


10.59 Corollary. Let E © X and f : E > Y. Then f is continuous on E if and 
only if f~'(V) 0 E is relatively open in E for all open sets V in Y. 


We shall refer to Theorem 10.58 and its corollary by saying that open sets 
are invariant under inverse images by continuous functions. It is interesting to 
notice that closed sets are also invariant under inverse images by continuous 
functions (see Exercises 10.6.3 and 10.6.4). 

It is natural to ask whether compact sets and connected sets are invariant 
under inverse images by continuous functions. The following examples show 
that, even for the metric space R, the answer to this question is no. 


10.60 EXAMPLES. 


i) If f(x) = 1/x and A = [0,1], then f is continuous on (0, oo) and H is 
compact, but f~!(H) = [1, 00) is not compact. 

ii) If f(x) = x* and E = (1,4), then f is continuous on R and E is connected, 
but f—!(£) = (—2, —1) U (1, 2) is not connected. 
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The next two results show that compact sets and connected sets are invariant 
under images, rather than inverse images, by continuous functions. 


10.61 Theorem. Jf H is compact in X and f : H — Y is continuous on H, then 
f(A) is compact in Y. 


Proof. Suppose that {Vy}ve4 is an open covering of f(H). By Theorem 1.37, 


refs gaye; (U v) = |J f7'(ve). 


aecA acA 


Hence, by Corollary 10.59, {f~!(Va)}wvea is a covering of H whose sets are 
all relatively open in H. Since H is compact, there are indices a1, a2,...,aN 
such that 


N 
HE |J fa) 


j=l 
(see Exercise 10.4.7). It follows from Theorem 1.37 that 


N N N 
fet | JF Op. | a UG ef Cay =| ve 
j=l j=l 


j=l 
Therefore, f(H) is compact. a 


10.62 Theorem. [f FE is connected in X and f : E — Y is continuous on E, then 
f (E) is connected in Y. 


Proof. Suppose that f(£) is not connected. By Definition 10.44, there exists 
a pair U,V c Y of relatively open sets in f(E) which separates f(E). By 
Exercise 10.6.4, f~'(U) N E and f~!'(V) NE are relatively open in E. Since 
f(E) =U UV, we have 


E=(f 1U)N EUW) NE). 


Since UN V = @, we also have 7 OW) ‘a FV) = %. Thus f 1M) NE, 
f—'(V)  E is a pair of relatively open sets which separates E. Hence, by 
Definition 10.44, E is not connected, a contradiction. a 


(Note: Theorems 10.61 and 10.62 do not hold if compact or connected are re- 
placed by open or closed. For example, if f(x) = x* and V = (—1, 1), then f is 
continuous on R and V is open, but f(V) = [0, 1) is neither open nor closed.) 
Suppose that f is a real function continuous on a closed, bounded interval 
[a, b]. Then the function F(x) = (x, f(x)) is continuous from R into R’. Since 
the graph of y = f(x) for x € [a, b] is the image of [a, b] under F, it follows 
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from Theorems 10.61 and 10.62 that the graph of f is compact and connected. 
It is interesting to note that this property actually characterizes continuity of 
real functions (see Theorem 9.71 in the preceding chapter). 

To illustrate the power of the topological point of view presented above, com- 
pare the proofs of the following theorem and Exercise 10.6.5 with those of The- 
orems 3.26 and 3.29. 


10.63 Theorem. [EXTREME VALUE THEOREM]. 
Let H be a nonempty, compact subset of X and suppose that f : H > Ris 
continuous. Then 


M :=sup{f(x):x€ H} and m:=inf{f(x):x € A} 


are finite real numbers and there exist points xu, Xm € H such that M = f (xm) 
and m = f (Xm). 


Proof. By symmetry, it suffices to prove the result for M. Since H is compact, 
f (A) is compact. Hence, by the Theorem 10.37. f(H) is closed and bounded. 
Since f(H) is bounded, M is finite. By the Approximation Property, choose 
xz € H such that f(x.) > Mask — oo. Since f(A) is closed, M € f(A). 
Therefore, there is an xy € H such that M = f(xy). A similar argument 
shows that m is finite and attained on H. =| 


The following analogue of Theorem 4.32 will be used in Chapter 13 to exam- 
ine change of parametrizations of curves and surfaces. 


10.64 Theorem. I/f H is a compact subset of X and f : H — Y is 1-1 and 
continuous, then f—' is continuous on f (H). 


Proof. By Exercise 10.6.4a, it suffices to show that (f —!)—! takes closed sets 
in X to relatively closed sets in f(H). Let E be closed in X. Then EN AH isa 
closed subset of H, so by Remark 10.36, E N H is compact. Hence, by Theo- 
rem 10.61, f(EM HA) is compact, in particular, closed. Since f is 1-1, f(EN A) 
= f(E)N f(A) (see Exercise 1.5.7). Since f(E M H) and f(#) are closed, it 
follows that f(Z)N f(A) is relatively closed in f(H). Since (f-')-! = f, we 
conclude that (f~!)~!(E) N f(A) is relatively closed in f(#). a 


If you are interested in how to use these topological ideas to study real functions 
further, you may read Section 9.8 in the preceding chapter now. 
EXERCISES 
10.6.1. Let f (x) = cosx and g (x) = |x| /x ifx #0 and g (0) = 0. 


a) Find f (£) and g(£) for E = (0,27), E = [0,27], E = (0, 1), and 
E = [0,1]. Compare your answers to what Theorems 10.58, 10.61, 
and 10.62 predict. Explain any departures from the predictions. 
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10.6.2. 


10.6.3. 


10.6.4. 


10.6.5. 


10.6.6. 


Metric Spaces 


b) Find f~! (£) and g~! (E) for E = (0, 1), E = [0, 1], E = (—1, 1), and 
E = [1,1]. Compare your answers to what Theorems 10.58, 10.61, 
and 10.62 predict. Explain any departures from the predictions. 


Let f(x) = /x and g(x) = 1/x if x 40 and g(0) = 0. 


a) Find f(£) and g(£) for E = (0, 1), E = [0, 1), and E = [0, 1]. Com- 
pare your answers to what Theorems 10.58, 10.61, and 10.62 predict. 
Explain any departures from the predictions. 

b) Find f~!(£) and g~!(E£) for E = (—1,1) and E = [-1, 1]. Com- 
pare your answers to what Theorems 10.58, 10.61, and 10.62 predict. 
Explain any departures from the predictions. 


Suppose that f : X — Y. Prove that f is continuous if and only if 
f—\(C) is closed in X for every set C closed in Y. 
Suppose that E C X and that f: E > Y. 


a) Prove that f is continuous on E if and only if f~'(A) E is relatively 
closed in EF for all closed sets A in Y. 

b) Suppose that f is continuous on £. Prove that if V is relatively open 
in f(E), then f—!(V) is relatively open in E£, and if A is relatively 
closed in f(E), then f~!(A) is relatively closed in E. 


[INTERMEDIATE VALUE THEOREM]. Let E be a connected subset of X. If 
f : E — Ris continuous, f(a) 4 f(b) for some a,b € E, and yisa 
number which lies between f(a) and f(b), then prove that there is an 
x € E such that f(x) = y. (You may use Theorem 10.47.) 

Suppose that H is a nonempty compact subset of X and that Y is a Eu- 
clidean space. 


a) If f : H > Y is continuous, prove that 


If lla := sup || f@)lly 
xeH 


is finite and there exists an x9 € H such that || f (xo) |ly = | f lla. 

b) A sequence of functions f; : H — Y is said to converge uniformly 
on H toa function f : H — Y if and only if given e > 0 there is an 
N € Nsuch that 


k>N and xe€H imply | f(x) -—f@)lly <e. 


Show that || f; — fll4 ~ Oask > ooif and only if f, > f uniformly 
on Hask > ~. 

c) Prove that a sequence of functions f; converges uniformly on A if 
and only if, given e > 0, there is an N € N such that 


k,j=N implies ||fe — fjlla <e. 
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10.6.7. Suppose that E is a compact subset of X. 


a) If f,g : E — R" are uniformly continuous, prove that f + g and 
f -g are uniformly continuous. Did you need compactness for both 
results? 

b) If g : E > Ris continuous on E and g(x) # 0 for x € E, prove that 
1/g is a bounded function. 

c) If f,g : E — Rare uniformly continuous on EF and g(x) 4 0 for 
x € E, prove that f/g is uniformly continuous on E. 

10.6.8. Suppose that E C X and that f: E> Y. 


a) If f is uniformly continuous on E and x, € E is Cauchy in X, prove 
that f(x) is Cauchy in Y. 

b) Suppose that D is a dense subspace of X (i.e., that D C X and D = X). 
If Y is complete and f : D — Y is uniformly continuous on D, prove 
that f has a continuous extension to X; that is, prove that there is a 
continuous function g : X — Y such that g(x) = f(x) for all x € D. 


10.6.9. Suppose that X is connected. Prove that if there is a nonconstant, con- 
tinuous function f : X — R, then X has uncountably many points. 


*10.7 STONE-WEIERSTRASS THEOREM 


This section uses the Binomial Series (Theorem 7.52). Since these results are not 
used elsewhere, it can be skipped. 


One of the oldest questions in analysis is the following: 


APPROXIMATION QUESTION. Can one use polynomials to approximate 
continuous functions on an interval [a, b]? 


For well over 150 years, mathematicians thought that the way to answer 
this question was to use Taylor polynomials. When Cauchy showed that this 
approach was doomed to failure, even for the smaller space C™[a, b] (see 
Remark 7.41), other avenues of investigation were opened up. The main result 
of this section, the Stone—Weierstrass Theorem, shows that the answer to the 
Approximation Question is yes in a very general sense. You will explore some 
of the consequences of this powerful result in the exercises. It has many other 
far-reaching consequences as well and is valid in a much more general setting 
than compact metric spaces. 

For each metric space X, C(X) will represent the collection of continu- 
ous functions from X to R. The Stone—Weierstrass approach to approximation 
requires that the collection of approximating functions be closed under point- 
wise addition, pointwise multiplication, and scalar multiplication. Thus we begin 
with the following concept. 


10.65 Definition. 
A set A is said to be a (real function) algebra in C(X) if and only if 


i) DAA CC(X). 
ii) If f,g ¢ A, then f + g and fg both belong to A. 
iti) If f ¢e Aandc ER, thencf € A. 
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We notice once and for all that every algebra of functions contains the zero 
function, since f € A implies —f € A;henceO= f—f eA. 

It is well known that the collection of one variable polynomials is an algebra 
in C(R). By Theorem 10.53, C(X) itself is an algebra in C(X) for any metric 
space X. 

As you know, there are several types of convergence for function sequences. 
The Stone—Weierstrass Theorem is a result about the strongest of these: uniform 
convergence. To streamline our presentation, we introduce a metric on C(X) 
which governs uniform convergence. Namely, we define dist(f, g) := || f — gll, 
where uniform norm of an f € C(X) is defined to be 


II fll = sup | f(@)I. 
xEx 


If X is compact, then dist(f, g) is a metric on C(X). Indeed, by the Extreme 
Value Theorem, dist(f, g) is finite for each f, g € C(X). And since the abso- 
lute value metric on R is positive definite, symmetric, and satisfies the Triangle 
Inequality, it is easy to verify that dist(f, g) does, too. 

Notice that || f; — f|| ~ 0asn — oo if and only if for each e > 0 there is 
an N € N such that n > N implies |f,(x) — f(x)| < e for allx € X. Thus 
convergence of a sequence f, in the metric of C(X) is equivalent to uniform 
convergence of f, on X. 


10.66 Definition. 


Let X be a metric space. 


a) A subset A of C(X) is said to be (uniformly) closed if and only if for each 
sequence f, € A that satisfies || f, — f|| > 0asn — oo, the limit function 
f belongs to A. 

b) A subset A of C(X) is said to be uniformly dense in C(X) if and only if given 
é > Oand f € C(X) there is function g € A such that ||g — f|| < «. 


It is easy to see that C(X) is uniformly closed (modify the proof of Theo- 
rem 7.9). If we can show that the polynomials are uniformly dense in C[a, b], 
then the answer to the Approximation Question is evidently yes. 

It turns out that uniformly closed algebras which contain the constant func- 
tions are also closed under pointwise maxima and minima. 


10.67 Lemma. 

Suppose that X is a compact metric space and that A is a closed algebra in C(X) 
which contains the constant functions. For each f, g € C(X) and each x € X, 
define (f A g)(x) := min{ f(x), g(x)} and (f V g)(x) := max{ f(x), g(a)}. If 
ft, g € A, then f \ g and f V g both belong to A. 


Proof. Since f A g and f Vv g can be defined as an algebraic combination of 
(f +g) and | f — g| (see Exercise 3.1.8), and A is an algebra, it suffices to prove 
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that f € A implies | f| € A. If || f|| = 0, then | f| = 0 € A and there is nothing 


to prove. Hence, we may suppose that M := || f|| > 0. 
Let e > 0. Recall (see Theorem 7.52) that the Binomial Series 


jt] = —( —2?))!? 


_ 1 2 i 2\2 
=] a t“) 74h t“) (12) 
1-3-5 eee 2yk 
7 2-4-6---(2k) 


converges uniformly on compact subsets of (—V2, /2); hence on [—1, 1]. In 
particular, there is an N € N such that ifn > N and 


Al és 1 no W139 -5 +++ (2k —3) oe 
P,(t) = 1-5-1") - 571 -#*) 2 a RENTS (1—17)*, (13) 


then |P,,(t) — |t|| < e for all t € [—1, 1]. 

Fix x € X and set g,(x) := P,(f(x)/M). Since A is an algebra which 
contains the constant functions, g, € A. Since M = ||f|| > 0 implies that 
t := f(x)/M é [-1, 1], we have by the choice of P that 


en(x -| 2 | = |Pn(t) — |t|| <e 


for alln > N and all x € X. Thus Mg, € A and Mg, — |f| uniformly on X, 
as n — oo. Since A is uniformly closed, we conclude that | f| belongs to A. Hi 


We will identify conditions on an algebra A which guarantee that A is uni- 
formly dense in C(X). Since the collection of constant functions forms an alge- 
bra in C[a, b] which is not uniformly dense in C[a, b], such an A must contain 
some nonconstant functions. The following concept addresses this issue. 


10.68 Definition. 


A subset A of C(X) separates points of X if and only if given x, y € X with 
x # y there exists an f € A such that f(x) 4 f(y). 


We are now prepared to identify a huge class of dense algebras in C(X) when 
X is compact. 


10.69 Stone—Weierstrass Theorem. Suppose that X is a compact metric space. 
If A is an algebra in C(X) that separates points of X and contains the constant 
functions, then A is uniformly dense in C(X). 
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Proof. Fixx, y € X with x 4 y. Leta, b be any points in R. Since A separates 
points, there is a g € A such that g(x) # g(y). Since A is an algebra that 
contains the constants—in particular, contains g(x) and g(y)—the function 


gt)—g(y) | , gf) — 8@) 
g(x)—g(y) a (y) — 8) 


f(t) :=a 


also belongs to A. Since f(x) = a and f(y) = b, we have proved the follow- 
ing: Given x, y € X, 


ifx # yanda,b ER, then f(x) =a and f(y) =b forsome f € A. (14) 


Let B represent the uniform closure of A (i.e., B is the set of all functions f 
for which there exists a sequence f,, € A such that || f, — f|| ~ 0asn > oo). 
Since A is an algebra, so is B. Since B is by definition uniformly closed, it 
contains its pointwise maxima and minima (see Lemma 10.67). 

Let F € C(X) and e > 0. We must show that there is a G € B such that 


F(x) —e€ < G(x) < F(x) +e (15) 


for all x € X. We will do this a point at a time and use the compactness of X 
to globalize these local estimates. 
Fix x9 € X. For each y # xo, use (14) to choose an f, € A C B such that 


fyQo) = Fo) and f(y) = FQ). 


Since f, and F are continuous, the set V, := {x € X : fy(x) < F(x) +é} 1s 
open. Since xo, y € Vy for all y € X, it is clear that 


x= |) v,. 


yAXO 


Since X is compact, there exist y; € X such that X = Uh Vy, Let fj = fy, 
for j = 1,..., N and set gy, := fi A--- A fy. Then gy, € B. Moreover, by 
construction, 


8x9 (X0) = F(x0) A+++ A F(x0) = Fo). 


Since x € X implies x € V\, for some k € [1, N], we also have 
Ban(X) < felx) < FQ) +e. (16) 


This is essentially the right half of (15). 
To finish the proof, repeat the argument by covering X by a finite collection 
of open sets {W,, ae where 


Wy, = (x € Xi gy (x) > F(x) — €}. 
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Let gj = gx, and set G := gi V--- Vv gm. By (16), G(x) < F(x) + € for all 
x € X. But since x € X implies that x € Wy; for some j, we also have 


G(x) >= gj(x) = F(x) —€ 


for all x € X. Therefore, (15) holds for all x € X. a 
EXERCISES 
10.7.1. a) Prove that given f € C[a, b], then there is a sequence of one-variable 


10.7.2. 


10.7.3. 


10.7.4. 


10.7.5. 


10.7.6. 


polynomials P, such that P, > f uniformly on [a, b] asn > oo. 
b) Prove that the metric space C[a, b] (see Example 10.6) is separable. 


A polynomial on R" is a function of the form 


MN Nn 
P = . . ji Jn 
(X41, X2,...5Xn) . a aj, pees in*] ran 


ji=0 Jn=0 


where the aj,,..., jn 8 are scalars and Nj,...,Nn € N. Prove that if A is 
compact in R” and f € C(A), then there is a sequence of polynomials P, 
on R” such that P, — f uniformly on A as k > oo. 

Let R = [a, b] x [c, d] be a rectangle in R’. A function f is said to have 
separated variables if 


N 
P(x, y) = Do ck fel) ge(y) 


k=1 


for some scalars cx, and functions fx € C[a, b], gx € C[c,d]. Prove that 
given f € C(R) there is a sequence of functions with separated variables, 
P,, such that P, — f uniformly on R asn > oo. 

Use Exercise 10.7.1 to prove that if f € C[a, b] and 


b 
i f(x) x* dx =0 


fork =0,1,..., then f(x) = 0 for all x € [a, b]. 
Use Exercise 10.7.3 to prove that if f € C([a, b] x [c, d]), then 


i (ii) dx -{° (f° t.»ax) dy. 


Let T = [0, 27). 


a) Prove that the function 
®(x) := (cos x, sinx) 


is 1-1 from T onto dB,(0,0) C R’. 
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b) 


Cc 


wa 


d) 


10.7.7. 


Metric Spaces 


Prove that 
P(x, y) := ||P) — &GY)II 


is ametric on T. 

Prove that a function f is continuous on (T, p) if and only if it is 
continuous and periodic on [0, 27]; that is, if and only if f has an 
extension to [0, 277] which is continuous in the usual sense which also 
satisfies f(0) = f(2z). 


A function P is called a trigonometric polynomial if 
N 
P(x) = ax cos(kx) + by sin(kx) 
k=0 


for some scalars a;, by. Prove that given f € C(T) there is a sequence 
of trigonometric polynomials P, such that P, — f uniformly on 
[O, 27] asn > oo. 

Use Exercise 10.7.6 to prove that if f is continuous and periodic on 
[0, 27] and 


20 20 
f(x) cos(kx) dx = f(x) sin(kx) dx =0 
0 


fork =0,1,..., then f(x) =0 for all x ¢€ [a, b]. 


CHAPTER 11 


Differentiability on R” 


11.1 


PARTIAL DERIVATIVES AND PARTIAL INTEGRALS 


The most natural way to define derivatives and integrals of functions of several 
variables is to allow one variable to move at a time. The corresponding objects, 
partial derivatives and partial integrals, are the subjects of this section. Our main 
goal is to identify conditions under which partial derivatives, partial integrals, 
and evaluation of limits commute with each other (e.g., under which the limit of 
a partial integral is the partial integral of the limit). 

We begin with some notation. The Cartesian product of a finite collection of 
sets E,, E2,..., E, is the set of ordered n-tuples defined by 


E, x Ey x--+x Ey :={(41, X2,-..,Xn) 1X] €.6; for j= 1,2,4. 25m}. 


Thus the Cartesian product of n subsets of R is a subset of R”. By a rectangle 
in R” (or an n-dimensional rectangle) we mean a Cartesian product of n closed, 
nondegenerate, bounded intervals. An n-dimensional rectangle H = [a,, bj] 
x +++ X [dy, bp] is called an n-dimensional cube with side s if |b; — aj| = s for 
jJ=l,...,n. 

Let f : {x1} x +--+ x {xj—-1} x [a, b] x {xj41} x +++ x {xn} > R. We shall denote 
the function 


BOSS f Cie c eps te A sis t € [a, Db}, 


by f(x1,...,Xj-1,°,Xj41,---,%n). If g is integrable on [a, b], then the partial 
integral of f on [a, b] with respect to x; is defined by 


b b 
| teres | g(t) dt. 


If g is differentiable at some to € (a,b), then the partial derivative (or first- 


order partial derivative) of f at (x1,...,%j-1,t0,Xj41,---Xn) with respect to x; 
is defined by 

af 

Sisto dey eek an) ee UG) 

OX; 
We will also denote this partial derivative by Fag Minions Ri alg liepplo teen an), 


Thus the partial derivative f,, exists at a point a if and only if the limit 


397 
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Ay i nei) =) 


OX; h->0 


exists. (Some authors use f; to denote the partial derivative f,;. To avoid con- 
fusing first-order partial derivatives with sequences and components of func- 
tions, we will not use this notation.) 

We extend partial derivatives to vector-valued functions in the following way. 
Suppose that a = (qj,...,dn) € R” andf = (fi, fo,..., fin) : {ar} x «++ x 
{aj—-1} x I x {aj4i} x +++ x {an} > R™”, where j € {1,2,...,} is fixed and / is 
an open interval containing a;. If for each k = 1, 2,...,m the first-order partial 
derivative df; /dx; exists at a, then we define the first-order partial derivative of 
f with respect to x; to be the vector-valued function 


_ fy (fig. Btn 
fy (a) := re (a) := (= (a),..., ae; «@)) 6 


Higher-order partial derivatives are defined by iteration. For example, the 
second-order partial derivative of f with respect to x; and x; is defined by 


5 of 8 (at 
aa Fo axj Ax \ Ox; 


when it exists. Second-order partial derivatives are called mixed when j # k. 
This brings us to the following important collection of functions. 


11.1 Definition. 
Let V be a nonempty, open subset of R”, let f: V > R”, and let p EN. 


i) f is said to be C? on V if and only if each partial derivative of f of order 
k < p exists and is continuous on V. 
ii) fis said to be C™ on V if and only if fis C? on V for all p EN. 


Clearly, if fis C? on V and gq < p, then fis C7 on V. By making obvious 
modifications in Definition 11.1 using Definition 4.6, we can also define what it 
means for a function to be C? on a rectangle H. We shall denote the collection 
of functions which are C? on an open set V, respectively, on a rectangle H, by 
C?(V), respectively, by C?(H). 

For simplicity, we shall state all results in this section for the case n = 2 and 
m = 1, using x for x; and y for x2. (It is too cumbersome to do otherwise.) It is 
clear that, with appropriate changes in notation, these results also hold for any 
n,m EN. 

Since partial derivatives and partial integrals are essentially one-dimensional 
concepts, each one-dimensional result about derivatives and integrals contains 


Section 11.1 Partial Derivatives and Partial Integrals 399 


information about partial derivatives and partial integrals. Here are three 
examples. 


1) By the Product Rule (Theorem 4.10), if f, and g, exist, then 


of 


a (Fa) =f 58 + oe 
x! 


2) By the Mean Value Theorem (Theorem 4.15), if f(-, y) is continuous on [a, b] 
and the partial derivative f,(-, y) exists on (a, b), then there is a point c € 
(a, b) (which may depend on y as well as a and b) such that 


f(b, y)—f@y) =(b—a) Ete, 3) 


3) By the Fundamental Theorem of Calculus (Theorem 5.28), if f(-, y) is con- 
tinuous on [a, b], then 


0 Xx. 
ax i f@, y)dt = f(x,y), 
X Ja 


and if the partial derivative f,.(-, y) exists and is integrable on [a, b], then 


b9 
- Ho ea tE Faw, 


Our first result about the commutation of partial derivatives, partial integrals, 
and evaluation of limits deals with interchanging two first-order partial deriva- 
tives (see also Exercise 11.2.11). 


11.2 Theorem. Suppose that V is open in R’, that (a, b) € V, and that f:V +R. 
If f is C! on V, and if one of the mixed second partial derivatives of f exists on V 
and is continuous at the point (a, b), then the other mixed second partial derivative 
exists at (a, b) and 


af af 
b= , b). 
ay ax" ) ax ay ) 


NOTE: These hypotheses are met if f € C*(V). 


Proof. Suppose that f,, exists on V and is continuous at the point (a, d). 
Consider A(h,k) := f(ath,b+k)—-— fiath,b) — fla,b+k) + fla, b), 
defined for |A|, |k| < r//2, where r > 0 is so small that B,(a,b) C V. Apply 
the Mean Value Theorem twice to choose scalars s, t € (0, 1) such that 


q2 
(hb) = RE (a th, b+ tk) — ke (a,b + th) = hk r 


a b-+tk). 
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Since this last mixed partial derivative is continuous at the point (a,b), 
we have 


ree SU eae 
lim lim = 
k>0h>0 hk Ox Oy 


(a, b). (1) 


On the other hand, the Mean Value Theorem also implies that there is a scalar 
u € (0, 1) such that 


A(h,k) = flath,b+k)— f(a,b +k) — f(ath,b) + f(a, b) 
af af 


=h—(atuh,b+k)—h—(a+uh, b). 
ox Ox 


Hence, it follows from (1) that 


k>0N>0k 0 


ek ie INGE) 20°F 
= lim lim = 
k>0h>0 hk ax dy 


1 /d ) 
lim lim — (Feta +uh,b+k)— Fag + uh, ») 
x x 


(a, b). 


Since f; is continuous on B,(a, b), we can let h = 0 in the first expression. We 
conclude by definition that 


a f 
dy Ox 


oie OT, of ef 
(a, b) =e (Haren — Hi (a.b)) = BE ay 


(a, b). a 

We shall refer to the conclusion of Theorem 11.2 by saying the first partial 
derivatives of f commute. Thus, if f is C? on an open subset V of R”, ifae V, 
andif j #k, then 


af anf 


a)= 
OX j OX, OX OX j 


(a). 


The following example shows that Theorem 11.2 is false if the assumption 
about continuity of the second-order partial derivative is dropped. 


11.3 EXAMPLE. 


Prove that 
x22 


y (x,y) =0 


is C! on R’, both mixed second partial derivatives of f exist on R?, but the first 
partial derivatives of f do not commute at (0, 0); that is, fy, (0,0) 4 f)x (0, 0). 
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Proof. By the one-dimensional Product and Quotient Rules, 
af a (x? — y? 0 x2 y? 
ae oe ete ae (= + e (xy) ae 

Axy? + Roe > 
=e oo ——— 
"Lar sy) 7 Gry 


for (x, y) # (0, 0). Since 2|xy| < x7+ y?, we have | f; (x, y)| < 2|y|. Therefore, 
fc, y) > Oas (x, y) > (0, 0). On the other hand, by definition 


for all y € R; hence, f, (0, 0) = 0. This proves that f, exists and is continuous 
on R? with value —y at (0, y). A similar argument shows that fy exists and 
is continuous on R? with value x at (x, 0). It follows that the mixed second 
partial derivatives of f exist on R’, and 


0° f a f 
0,0) =-14A1= 
By ax’ ) a Ox Oy 


(0, 0). | 


The following result shows that we can interchange a limit sign and a partial 
integral sign when the integrand is continuous on a rectangle. 


11.4 Theorem. Let H = [a,b] x [c,d] be a rectangle and let f : H > R be 
continuous. If 


b 
ry) = | f(x, y)dx, 


then F is continuous on [c, d]; that is, 


b b 
lim f(x, y)dx = / lim f(x, y)dx 
yoyo Ja a Yoyo 
yelc,d] yele,d] 


for all yo € [c, d]. 


Proof. For each y ¢€ [c,d], f(-, y) is continuous on [a, b]. Hence, by Theo- 
rem 5.10, F(y) exists for y € [c, d]. 

Fix yo € [c, d] and let e > 0. Since H is compact, f is uniformly continuous 
on H. Hence, choose 6 > 0 such that ||(x, y) — (z, w)|| < 6 and (x, y), (z, w) 
€ H imply 


If (x, y) — f(z, w)| < = 
—a 
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Since |y — yo| = ||(x, y) — (x, yo)|l, it follows that 


b 
IF (y) — F(yo) | < | If (x,y) — f(x, yo)|dx < 


for all y € [c, d] which satisfy |y — yo| < 6. We conclude that F is continuous 
on [c, d]. | 


The following result shows that we can interchange a derivative and an inte- 
gral sign when the first partial derivative of the integrand is sufficiently smooth. 
We will refer to this process as differentiating under the integral sign. 


11.5 Theorem. Let H = [a,b] x [c,d] be a rectangle in R* and let f : H > R. 
Suppose that f (-, y) is integrable on [a, b] for each y € [c, d] and that the partial 
derivative fy(x, -) exists on [c, d] for each x € [a, b]. If the two-variable function 
fy (x, y) is continuous on H, then 


a, Flax, yax= fo se yds 


NOTE: These hypotheses are met if f € C'(H). 


Proof. Recall that “ f, (x, -) exists on [c, d]” means f(x, -) exists on (c, d), and 


f@,ce+h) -—f(,c) a 3 t(x,d +h) —f (x, d) 
h>0+ h 5 ee a h 


forall y € [c,d]. 


fy, = 


both exist (see Definition 4.6). Hence, it suffices to show that 


kim > f(x,y th) — fx, Y» 4 ref ho. ae 


h->0+ h 


for y € [c, d), and that 


es > f(x,y +h) — f, Ya ref eae 


h>0- h 


for y € (c,d]. The arguments are similar; we provide the details only for the 
first identity. 

Fix x € [a,b] and y € [c, d), and let h > 0 be so small that y +h € [c, d). 
Let ¢ > 0. By uniform continuity, choose a 6 > 0 so small that |y — z| < 6 and 
x € [a, b] imply | fy, y) — fy(x, z)| < e/(b—a). By the Mean Value Theorem, 
choose a point z(x; 4) between y and y +A such that 


f(x, yt+h)— f(@, y) =o 


F(x, y,h):= h ie z(x;h)). 
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Since |z(x; h) — y| = z(x;h) — y <h, it follows that if 0 < h < 6, then 
b 
=a E 
a 


b b9 a a 
/ F(x, y,h) dx -{ (as yds (ants h)) — ay y)| dx <e. 
b b 
al fe. ax = f Ne a 
dy Ja a Oy 


Therefore, 


Thus if H = [a,, bj] x --- x [ay, by] is an n-dimensional rectangle, if f is C! on 
H,andifk # j, then 


a. i bj 9 
2B fGnsadday=of ee Ti (2) 


OXK aj OX, 


The rest of this section contains optional material which shows what happens 
to the results above when the improper integral is used. 


We begin by borrowing a concept from the theory of infinite series. 


*11.6 Definition. 
Let a < b be extended real numbers, let J be an interval in R, and suppose 


that f : (a,b) x I > R. The improper integral 


b 
/ f(x, y) dx 


is said to converge uniformly on I if and only if f(-, y) is improperly integrable 
on (a, b) for each y € J and given ¢ > 0 there exist real numbers A, B € (a, b) 
such that 


<€é 


b B 
/ fee, ax — | f(x, y)dx 


forala<a<A, B<6B<b,andallyel. 


For most applications, the following simple test for uniform convergence of 
an improper integral will be used instead of Definition 11.6 (compare with The- 
orem 7.15). 


*11.7 Theorem. [WEIERSTRASS M-TEST]. 

Suppose that a < b are extended real numbers, that I is an interval in R, that 
f : (a,b) x I > R, and that f (., y) is locally integrable on the interval (a, b) 
for each y € I. If there is a function M : (a,b) > R, absolutely integrable on 
(a, b), such that 


If@, y)| <= M(x) 
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forall x € (a,b) and y € I, then 
b 
/ f(a, y)dx 
a 


converges uniformly on I. 


Proof. Let ¢ > 0. By hypothesis and the Comparison Test for improper inte- 


grals, i} i f(x, y) dx exists and is finite for each y € J. Moreover, since M(x) 
is improperly integrable on (a, b), there exist real numbers A, B such that 
a<A<B<band 


A b 
/ moyax+ [ M(x) dx <e. 
a B 


Thus for eacha <a < A < B < B < bandeach y € J, we have 


a b 
< | fee lac +f Ife, yl dx 


b B 
i fee. ax — | f(, y)dx 


A b 
< | Moydx+ [ M(x)dx <é. P 
a B 
The following is an improper integral analogue of Theorem 11.4. 


*11.8 Theorem. Suppose that a < b are extended real numbers, that c < d are 
finite real numbers, and that f : (a, b) x [c,d] > Ris continuous. If 


b 
ry) = | f(x, y) dx 


converges uniformly on [c, d], then F is continuous on [c, d]; that is, 
b b 
lim S(x,y) dx =i) lim f(x, y)dx 
y>y0 Ja a Y>Yo 
yelc,d] yelc,d] 
for all yo € [c, d]. 


Proof. Let « > 0 and yo € [c,d]. Choose real numbers A, B such that 
a<A<B< band 


€ 
<s- 
3 


B 
Fo) -{ f(x, y) dx 
A 
for all y € [c, d]. By Theorem 11.4, choose 6 > 0 such that 


€ 
<s- 
3 


B 
if fw ods 
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for all y € [c, d] which satisfy |y — yo| < 6. Then 


B B 
IF(y) — F(yo)| < Fo) ff f(x, y) dx} + i. (f(x, y) — fF, yo)) dx 


B 
=F Foo -| f(%, yo) dx 

A 

ee at : a a 
Came aa ae 
for all y € [c, d] which satisfy |y — yo| < 6. | 
The proof of Theorem 11.5 can be modified to prove the following result. 

*11.9 Theorem. Suppose that a < b are extended real numbers, that c < d 


are finite real numbers, that f : (a,b) x [c,d] — R is continuous, and that the 
improper integral 


b 
Fo) = | f(x, y)dx 


exists for all y € [c, d]. If fy(x, y) exists and is continuous on (a, b) x [c, d] and if 


a) 
$(y) = wx, ») dx 
a y 


converges uniformly on [c, d], then F is differentiable on [c, d] and F'(y) = $(y); 


that is, 
af? an) 
=| fs. yydx = f OF Gc ae 
dy Ja a Oy 


forall y € [c, d]. 


For a result about interchanging two partial integrals, see Theorem 12.31 and 
Exercise 12.3.10. 


EXERCISES 


11.1.1. Compute all mixed second-order partial derivatives of each of the 
following functions and verify that the mixed partial derivatives 
are equal. 


‘ x= 2 

a) fy) =ye* db) fy) =sin@y) 0) fy) = 552 

11.1.2. For each of the following functions, compute f, and determine where 
it is continuous. 
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11.1.3. 


11.1.4. 


11.1.5. 


11.1.6. 


Differentiability on R” 


xe ey? 
a) fa,yelpay WF) 


y 
0 (x, y) = (0, 0) 


x= *” 9) #00) 

>_> x, ’ 
b) f@,y)= 1x2 + y2 4 

0 (x, y) = (0, 0) 

Suppose that r > 0, that a € R”, and that f: B,(a) > R”. If all first- 
order partial derivatives of f exist on B,(a) and satisfy fx (x) = 0 for 
all x € B,(a) and all j = 1,2,...,n, prove that f has only one value on 
B,(a). 
Suppose that H = [a,b] x [c,d] is a rectangle, that f : H > Ris 
continuous, and that g : [a, b] > Ris integrable. Prove that 


b 
F(y) =| g(x) f(x, y) dx 


is uniformly continuous on [c, d]. 
Evaluate each of the following expressions. 


1 
. 32.9 
a) lim | e* *** dx 
y>0 Jo 


1 
b) al cos(e*y—y*+2—e*)dx aty=1 
dy Jo 


a 3 
c) ~| (xety+e—2dz at.» =(D 
ax 1 


Suppose that f is a continuous real function. 


a) If ih f (x) dx = 1, find the exact value of 
7 7 x7 y24x 
lim f(x — 2)e* % 7 dx. 
y>0 Jo 
b) If f isC! on R and ie f'(x) cosxdx = e, find the exact value of 
TT 
tim, f f(x) sin(y’ + Yy+x)dx-e. 
y>0 Jo 
c) If i f(/x)e* dx = 8, find the exact value of 


d 3 
a fet” dy atx =0. 
x JO 
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*11.1.7. Evaluate each of the following expressions. 


1 


sin 
a) lim f 22%S®Y ay 
yo0t Jo Yxty—1 
d CO 4—-xy 
b) / Be SEE aty=1 
dy Jn x 


*11.1.8. a) Prove that 


[ sin(x? + y? + V2) 
dx 
0 


ie 


converges uniformly on (—oo, 00). 

b) Prove that ,° e~*dx converges uniformly on [1, 00). 

c) Prove that lee ye~*Ydx exists for each y € [0, 00) and converges 
uniformly on any [a, b] Cc (0, co) but that it does not converge uni- 


formly on [0, 1]. 


*11.10 Definition. 


The Laplace transform of a function f : (0, 00) > R is said to exist at a point 


s € (0, oo) if and only if the integral 


LAG [ en F(t) dt 


converges. (Note: This integral is improper at oo and may be improper at 0.) 


*11.1.9. Prove that 


a) L{l}(s) = a s>0 
Ss 
n! 
b) Lit" \(s) = nd? s> 0, neN 
Ss 
t 1 
c) Lf{e“V(s) = ——, s>a,aeR 
sS—a 
Ss 
d) L{cos(bt)}(s) = ape s> 0, beR 
L{sin(bt = e 0,beER 
e) {sin( JMS) = So S>U,D0E 
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*11.1.10. 


"111.11. 


Differentiability on R” 


Suppose that f : (0,cco) — R is continuous and bounded and that 
L{f} exists at some a € (0, 00). Let 


t 
P(t) = e “ f(u) du, t € (0, ow). 
0 


a) Prove that 


N N 
i ms G2) dt= (Nye SON tig a | e SO b(t) dt 
0 0 


for all N EN. 
b) Prove that the integral ie e~— g(t)dt converges uniformly on 
[b, co) for any b > a and 


i ee phd GSa ie e SO w(t) dt, S>a. 
0 0 


c) Prove that £{f} exists, is continuous on (a, oo), and satisfies 
lim L{f}(s) = 0. 
SCO 


d) Let g(t) = +f (rt) fort € (0, co). Prove that £{ f} is differentiable on 
(a, oo) and 


“c1ns) = 2A) 
S 


for all s € (a, 00). 
e) If, in addition, f’ is continuous and bounded on (0, 00), prove that 


L£(f')(s) = sL(f)(s) — f 0) 


for all s € (a, 00). 


Using Exercises 11.1.9 and 11.1.10, find the Laplace transforms of each 
of the functions te~~’, t cost, and f? sint. 


11.2 THE DEFINITION OF DIFFERENTIABILITY 


In this section we define what it means for a vector function to be differentiable 
at a point. Whatever our definition, we expect two things: If f is differentiable 
at a, then f will be continuous at a, and all first-order partial derivatives of f will 


exist at a. 


Working by analogy with the one-variable case, we guess that f is differen- 
tiable at a if and only if all its first-order partial derivatives exist at a. The fol- 
lowing example shows that this guess is wrong even when the range of f is one 
dimensional. 
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11.11 EXAMPLE. 


Prove that the first-order partial derivatives of 


X + x=0 or -—0 
f@, y= i a ? 


otherwise 
exist at (0, 0), but f is not continuous at (0, 0). 


Proof. Since 
lim f(x) =1#0= f(0,0), 


it is clear that f is not continuous at (0, 0). However, the first-order partial 
derivatives of f DO exist since 


f(A, 9) = f0,0) _ 


1 
h 


f<(0, 0) = lim 
and, similarly, (0,0) = 1. | 


Even if we restrict our attention to those functions f which are continuous and 
have first-order partial derivatives, we still cannot be sure that f is differentiable 
(see Exercise 11.2.7). How, then, shall we define differentiability in R"? 

When a mathematical analogy breaks down, it is often helpful to reformulate 
the problem in its original setting. For functions of one variable, we found that f 
is differentiable at a if and only if there is a linear function T € £(R; R) such that 


ii flath)— fla)—Tth) _ 
im = 
h->0 h 


0 


(see Theorem 4.3). Thus f is differentiable at a € R if and only if there is a 
T € L(R; R) such that the function e(h) := f(a+h) — f(a —T (A) converges to 
zero so fast that e(h)/h > 0 ash — 0. This leads us to the following definition. 


11.12 Definition. 
Suppose that a € R”, that V is an open set containing a, and thatf: V > R”. 


i) fis said to be differentiable at aif and only if there is a T € £(R”; R”) such 
that the function 
é(h) := f(a+ h) — f(a) — Tih) 


(defined for ||h|| sufficiently small) satisfies ¢(h)/||h|| > 0 ash > 0. 
ii) fis said to be differentiable on a set E if and only if E is nonempty and f is 
differentiable at every point in E. 


We shall see (Theorem 11.14) that if f is differentiable at a, then there is 
only one T that satisfies Definition 11.12. Its representing m x n matrix (see 
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Theorem 8.15) is called the total derivative of f (“total” as opposed to partial 
derivatives of f) and will be denoted by Df(a). 

The following result shows that if fis differentiable, then it is continuous. Thus 
differentiability, as defined in 11.12, rules out pathology such as Example 11.11. 


11.13 Theorem. [fa vector function f is differentiable at a, then f is continuous 
at a. 


Proof. Suppose that f is differentiable at a. Then by Definition 11.12 there 
exist a T € £(R"; R”) anda 6é > 0 such that ||f(a+h) — f(a) — T(h)|| < ||h|| for 
all ||h|| < 6. By the triangle inequality (Theorem 8.6iii) and the definition of 
the operator norm (see Theorem 8.17), it follows that 


||f(a + h) — f(a) |] < [Tl [Ibi] + [hI] 


for ||h|| < 6. Since ||T|| is a finite real number, we conclude from the Squeeze 
Theorem that f(a + h) > f(a) ash > 0 (i.e., f is continuous at a). | 


If f is differentiable at a, is there an easy way to compute the total derivative 
Df(a)? The following result shows that the answer to this question is yes. 


11.14 Theorem. Let f be a vector function. If f is differentiable at a, then all 
first-order partial derivatives of f exist at a. Moreover, the total derivative of f ata 
is unique and can be computed by 


a a 
Fa ... “Ae 
af Ox, OX 
Df(a) = | = oS 
J mxn Ofin Ofm 


Proof. Since f is differentiable, we know that there is an m x n matrix B := 
[b;;] such that 


f(a + h) — f(a) — Bh 
||| 


>0 ash + 0. (3) 


Fix 1 < j <nand seth = fe; for some t > 0. Since ||h|| = tr, we have 


f(a + h) — f(a) — Bh a f(a + te;) — f(a) 
||| a t 


Be;. 


Take the limit of this identity as t > 0+, using (3) and the definition of matrix 
multiplication. We obtain 


. f(a + te;) — f(a) 


= Be; = (b1;,...,Dmj)- 
nae ae t J (1; mj) 
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A similar argument shows that the limit of this quotient as tf + O— also exists 
and equals (b1;,..., bmj). Since a vector function converges if and only if its 
components converge (see Theorem 9.41), it follows that the first-order par- 
tial derivative of each component f; with respect to x; exists at a and satisfies 


Ofi 


OX; 


(a) = bij 


fori = 1,2,...,m. In particular, for each differentiable f and each point a, 
there is only one T that satisfies Definition 11.12, and its representing matrix is 


Ofi 
Df(a) := [bij lmxn = E «@)| : a 
J mxn 


The fact that for each function f differentiable at a, there is only one matrix B 
which satisfies (3) will be referred to as the uniqueness of the total derivative. 

By Theorem 11.14 and the uniqueness of the total derivative, we now have 
several ways to find out whether a given vector function f is differentiable. 
Indeed, f is differentiable at a point a if and only if there exists an m x n matrix 
B such that 


f(a +h) — f(a) — Bh _ 


h—0 ||| 2 
if and only if 
_ [f(a +h) — f(a) — Bh 
lim = 0, 
h>0 [| hI 
or if and only if 
f h) — f(a) — Df(a)(h 
im (a+ h) — f(a) (a)(h) _ 0. (4) 
h>0 [| hI 


We shall use these three descriptions interchangeably. The advantage of the first 
two conditions is that they can be applied without computing partial derivatives 
of f (e.g., see the proofs of Theorems 11.20 and 11.28). The last condition is more 
concrete but can only be used if you can compute the first-order partial deriva- 
tives of f at a (see Example 11.18). Notice that existence of these partial deriva- 
tives is NOT enough to conclude that f is differentiable (see Exercise 11.2.7). 

Abusing the notation a bit, if all first-order partial derivatives of function f 
exist at a point a, we shall denote the Jacobian matrix 


Ofi 


by Df(a). We shall only call it the “total derivative” when f is differentiable at a; 
that is, when f satisfies (4). 
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Ifn = 1 orm = 1, the Jacobian matrix Df is an m x 1 or 1 x n matrix and, 
hence, can be identified with a vector. Most applied mathematicians represent 
Df in these cases by different notations. For the case n = 1, 


i 2) 
Df) =|: 


fia) | 
is sometimes denoted in vector notation by 


f(a) :=(f{@,..., f,@). 
For the case m = 1, 


_ | of of 
ta) =| ww oe oa) 


is sometimes denoted in vector notation by 


a a 
Vf(a) := (saw, ess a) . 
Dal OXn 


(Vf is called the gradient of f because it identifies the direction of steepest 
ascent. For this connection and a relationship between gradients and directional 
derivatives, see Exercise 11.4.11.) 

If we strengthen the conclusion of Theorem 11.14, we can obtain a reverse 
implication. 


11.15 Theorem. Let V be open in R", leta € V, and suppose thatf: V > R". 
[f all first-order partial derivatives of f exist in V and are continuous at a, then f is 
differentiable at a. 


NOTE: These hypotheses are met if f is C! on V. 


Proof. Since a function converges if and only if each of its components con- 
verge (see Theorem 9.41), we may suppose that m = 1. By definition, then, it 
suffices to show that if f is real valued and has continuous first partial deriva- 
tives on V, then 


am LAth — f@)— Vila) bh 
im =0 
h>0 ||h]| 


Let a = (qaj,...,a,). Suppose that r > 0 is so small that B,(a) C V. Fix 
h = (fj,...,4n) 4 0 in B,(0). By telescoping and using the one-dimensional 
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Mean Value Theorem, we can choose numbers c; between aj; and a; + hj 
such that 


f(ath) — f(a) = fla, +h,...,an +hn) — f(ai,a2 +ha,...,ay + hy) 
+ +--+ f(ay,...,Q)—1,4n thn) — f(ay,...,Q) 


= ) hj=—(q,...,4j-1,€j,4j41 +hjqi,...,Qn +hn). 
Al Ox; 


Therefore, 
f(iat+h)— f(a)—Vf(a-h=h-6, (5) 


where 6 € R” is the vector with components 


of of 
oj = ae ng A iis CPAP RG Linas nce) = ee 


Since the first-order partial derivatives of f are continuous at a, 6; — 0 for 
each 1 < j <n (ie., ||6|| > 0 ash > 0). Moreover, by the Cauchy—Schwarz 
Inequality and (5), 


O< |f(a+h) — f(a) -— Vf (a) -hl Z |h - 3] 
7 || h|| || h|| 


< |S). (6) 


It follows from the Squeeze Theorem that the first quotient in (6) converges 
to 0 ash — 0. Thus f is differentiable at a by definition. 


If all first-order partial derivatives of a vector function f exist and are contin- 
uous at a point a (respectively, on an open set V), we shall call f continuously 
differentiable at a (respectively, on V). By Theorem 11.15, every continuously 
differentiable function is differentiable. In particular, every function which is C? 
on an open set V, for some | < p < o, is continuously differentiable on V. 

These results suggest the following procedure to determine whether a vector 
function f is differentiable at a point a. 


1) Compute all first-order partial derivatives of f at a. If one of these does not 
exist, then fis not differentiable at a (Theorem 11.14). 

2) If all first-order partial derivatives exist and are continuous at a, then f is 
differentiable at a (Theorem 11.15). 

3) If the first-order partial derivatives of f exist but one of them fails to be 
continuous at a, then use the definition of differentiability directly. By the 
uniqueness of the total derivative, this will involve trying to verify (3) or (4). 
A decision about whether this limit exists and equals zero will involve meth- 
ods outlined in Section 9.6. 


We close with some examples. 
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11.16 EXAMPLE. 
Is f(x, y) = (cos(xy), Inx — e) differentiable at (1, 1)? 


Solution. Since f, = (—ysin(xy), 1/x) and fy = (—x sin(xy), —e”) both exist 
and are continuous at any (x, y) € R? with x > 0, fis differentiable at any such 
(x, y), in particular, at (1, 1). | 


11.17 EXAMPLE. 


Is 


y? 


f@, y= 1x24 y2 (x, y) # (0, 0) 
o (x, y) = (0, 0) 
differentiable at (0, 0)? 


Solution. Again we begin by looking at the first-order partial derivatives of f. 
By the one-dimensional Quotient Rule, if (x, y) 4 (0, 0), then 


of 9) 2xy? 
—(x =-— 

ap (x2 + y)2 

To see whether the partial derivatives exist at (0, 0), we apply the definition of 
J. directly: 


Ff 0, 0) = lim fis FO”) = lim 2 = 0. 
Ox h->0 h ho>oh 
Thus f,(0, 0) = 0 DOES exist even though the formula approach above crashed. 
Notice, then, that we cannot rely on the rules of differentiation alone to compute 
partial derivatives. 

What about f, (0, 0)? Again, we use the definition of f,, not the formula: 


Fen, FOM=FO0 4 1 
ga ae 


Since this last limit does not exist, f;(0,0) does not exist. Hence f cannot be 
differentiable at (0, 0). a 


Our final example shows that the converse of Theorem 11.15 is false. 


11.18 EXAMPLE. 


Prove that 
G+y)sn— Gy) £0) 
f,y= Vx24+ y2 
0 (x, y) = (0, 0) 


is differentiable on R* but not continuously differentiable at (0, 0). 
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Proof. If (x, y) # (0,0), then we can use the one-dimensional Product Rule 
to verify that both f, and f, exist and are continuous, for example, 


1 : 
+ 2x sin 


—x 1 
cos . 
Vx2 + y? Vx2 + y? /x2 + y2 


Thus f is differentiable on R* \ {(0, 0)}. Since f,(x, 0) has no limit as x > 0, 

the partial derivative f, is not continuous at (0, 0). A similar statement holds 

for f. Thus to check differentiability at (0,0) we must return to the definition. 
First, we compute the partial derivatives at (0, 0). By definition, 


t,0) — f(0,0 ; ol 
Fx (0, 0) = lim SO dase BAUD = limt sin — = 0, 
t-0 t t-0 |t| 


fc (x, y) = 


and similarly, f,(0,0) = 0. Thus, both first partials exist at (0, 0) and 
V f (0, 0) = (0, 0). 

To prove that f is differentiable at (0, 0), we must verify (4) for a = (0, 0). 
But it is clear that 


h,k) — —V -(h,k 1 
fh, k) — fO, 0) — V FO, 0) OD 2 Jae ak ce 
I|(h, k) h h? + k* 
as (h,k) > (0,0). Thus f is differentiable at (0, 0). _| 
EXERCISES 
11.2.1. Suppose, for j = 1,2,...,n, that f; are real functions continuously 


differentiable on the interval (—1, 1). Prove that 


8(X) = fii) +++ fr%n) 


is differentiable on the cube (—1, 1) x (—1, 1) x --. x (-1, 1). 

11.2.2. Suppose that f, g: R — R” are differentiable at a and there is a5 > 0 
such that g(x) # 0 for all 0 < |x —a| < 6. If f(a) = g(a) = 0 and 
Dg(a) 4 0, prove that 


If@) | DF@ I 
im = 7 
xa |[g(x)||_ || Dg(a) | 


11.2.3. Prove that f(x, y) = ./|xy| is not differentiable at (0, 0). 
11.2.4. Prove that 


a 0< lla, yl <2 
x, 
ff, ¥y) = 1 1 —cos Vx? — 3y? ‘ 
0 (x, y) = (0,0) 


is not differentiable at (0, 0). 


416 Chapter 11 Differentiability on R” 


11.2.5. Prove that 


mes 0,0 
f(x,y) = 4 (2x2 + 2y2)# (x, y) # (0,0) 
0 (x, y) = (0,0) 


is differentiable on R? for all w < 3/2. 
11.2.6. Prove that if a > 1/2, then 


_ [iyi loge? +9?) @ y) £0) 


f(x,y) 0 (x, y) = (0, 0) 


is differentiable at (0, 0). 
11.2.7. Prove that 


x? aye 
rime EY 
(x, y) = (0, 0) 


is continuous on R? and has first-order partial derivatives everywhere 
on R’, but f is not differentiable at (0, 0). 
11.2.8 |. This exercise is used several times in this chapter and the next. Sup- 
pose that T € £(R"; R”). Prove that T is differentiable everywhere on 
R’ with 


DT(a)=T for aecR”. 


11.2.9. Letr > 0, f : B,(0) — R, and suppose that there exists ana > 1 such 
that | f(x)| < ||x||® for all x € B,(0). Prove that f is differentiable at 0. 
What happens to this result when a = 1? 
11.2.10. Let V be open in R", ae V, andf: V > R”. 


*11.19 Definition. 


If u is a unit vector in R” (i.e., ||ul| = 1), then the directional derivative of f at a 
in the direction u is defined by 
f(a + tu) — f(a) 


Dyf(a) := lim 


when this limit exists. 


a) Prove that Dyf(a) exists for u = e, if and only if f,, (a) exists, in which case 
of 
De, f(a) = —(a). 
ex (a) ax; (a) 
b) Show that if f has directional derivatives at a in all directions u, then the first- 


order partial derivatives of f exist at a. Use Example 11.11 to show that the 
converse of this statement is false. 
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c) Prove that the directional derivatives of 


xy 
(Ned wag: TO 


0 (x, y) = (0, 0) 


exist at (0, 0) in all directions u, but f is neither continuous nor differentiable 
at (0, 0). 
11.2.11. Let r > 0, (a,b) € R’, f : B-(a,b) > R, and suppose that the first- 


order partial derivatives f, and fy exist in B,(a, b) and are differen- 
tiable at (a, b). 


a) Set A(h) = f(ath, b+h)— f(ath, b) — f(a,b+h)+ f(a, b) and 
prove for h sufficiently small that 


Ay fyla+h,b+th) — fy(a,b) — V fy(a, b) - (h, th) 


h 
— (fy(a,b + th) — fila, b) —V fy(a, b) « 0, th) + hfyx(a, b) 


for some t € (0, 1). 
b) Prove that 


_ Ah) 
Hin Sor = fyx(, b). 
c) Prove that 
a2 2 
i (a, b) i (a, b) 
0x dy dyad 


11.3. DERIVATIVES, DIFFERENTIALS, AND TANGENT PLANES 


In this section we begin to explore the analogy between Df and f’. First we 
examine how the total derivative interacts with the algebra of functions. 


11.20 Theorem. Leta € R, a € R", and suppose that f and g are vector func- 
tions. If f and g are differentiable at a, thenf+g, of, and f-g are all differentiable 
at a. In fact, 


D(f + g)(a) = Df(a) + Dg(a), (7) 
D(af)(a) = aDf(a), (8) 

and 
D(f- g)(a) = g(a) Df(a) + f(a) Dg(a). (9) 


[The sums which appear on the right side of (7) and (9) represent matrix 
addition, and the products which appear on the right side of (9) represent matrix 
multiplication. ] 
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Proof. The proofs of these rules are similar. We provide the details only for 
the hardest of them, (9). Let 


T = g(a) Df(a) + f(a) Dg(a). (10) 


Since g(a) and f(a) are 1 x m matrices, and Df(a) and Dg(a) are m xn matrices, 
T is a 1 x n matrix, the right size for the total derivative of f- g. By the 
uniqueness of the total derivative, we need only show that 


im C D@+D — -g@) — TH) _ 
im = 
h>0 ||h]| 


0. 


Since by (10), 


(f- g)(a+ h) — (f- g)(a) — T(h) 
= (f- g)(a+h) — (f- g)(a) — g(a) Df(a)(h) — f(a) Dg(a)(h) 
= (f(a + h) — f(a) — Df(a)(h)) - g(a + h) 
+ (Df(a)(h)) - (g(a + h) — g(a)) 
+ f(a) - (g(a + h) — g(a) — Dg(a)(h)) 
=: T;(h) + 7o(h) + 73(h), 


it suffices to verify T;(h)/||h|| > 0 ash — Ofor j = 1,2, 3. 

Set e(h) = f(a+h) — f(a) — Df(a)(h) and 5(h) = g(a+h) — g(a) — Dg(a)(h) 
for ||h|| sufficiently small. Since f and g are differentiable at a, we know that 
é(h)/||h|| and 5(h)/||h|| both converge to zero ash > 0. 

To estimate 7), use the Cauchy—Schwarz Inequality and the definition of ¢ 
to verify 


|T; (h)| < [f(a + h) — f(a) — Df(a)(h)|| Iga + b)|| = |le(h)|| g(a + bh) I. 


Since g is continuous at a (Theorem 11.13) and e(h)/||h|| ~ O ash — 0, 
it follows that 7,(h)/||h|| ~ 0 ash — 0. A similar argument shows that 
T3(h)/||h|| > O ash > 0. 

To estimate 7, observe by the Cauchy—Schwarz Inequality and the defini- 
tion of the operator norm (see Theorem 8.17) that 


|72(h)| < || Df(a) (h)|| |lg(a + h) — g(a)|l < || D£(@)|| Ih] [lg(a + h) — g(a)||. 


Thus |7>(h)|/||h|| < || Df(a)|| |lg(a + h) — g(a)|| — 0 ash — 0. We conclude 
that f - g is differentiable at a and its total derivative is T. a 


Formula (7) is called the Sum Rule; (8) is sometimes called the Homogeneous 
Rule; and (9) is called the Dot Product Rule. (We note that a quotient rule also 
holds for real-valued functions; see Exercise 11.3.6.) 

Continuing to explore the analogy between Df and f’, let g be a real function 
and f be a real-valued vector function. We know that g is differentiable at a 
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point a if and only if the curve y = g(x) has a unique tangent line at (a, g(a)), 
in which case g’(a) is the slope of that tangent line. What happens in the mul- 
tidimensional case? Working by analogy, we expect that f is differentiable at 
a point a € R” if and only if the surface z = f(x) has a unique tangent hyper- 
plane at the point (a, f(a)) := (a1, ...,4n, f(M1,---,4n)) € R"+!. Moreover, we 
expect that the normal vector n of that tangent hyperplane is somehow related 
to the total derivative V f(a). We shall show that both of these expectations are 
correct, and that the relationship between n and V f (a) is a simple one [see (12) 
below and Exercise 11.6.9b]. Thus, for the case m = 1, Definition 11.12 captures 
both the analytic and geometric spirit of the one-dimensional derivative. 
First we define what we mean by a tangent hyperplane. 


11.21 Definition. 


Let S be a subset of R” andc € S. A hyperplane /7 with normal nis said to be 
tangent to S at cif and only if /7 contains e and 


fase — (11) 
k>oo lex = ral 


for all sequences c, € S\ {ce} which converge to ec. 


9h, k 
ya —~ II, (a, b) 


= _ _ z=f(@% y) 
1 (a, b) 
(ath, b+k) 
FIGURE 11.1 


Definition 11.21 is illustrated for the case n = 3 in Figure 11.1. There S is the 
surface z = f(x, y), c= (a,b, f(a, b)), and 6) , represents the angle between n 
and the vector from ec to (a+h, b+k), f(a+h, b+k). Notice, by (2) in Section 8.1, 
that (11) is equivalent to assuming that the angle between n and cj —e converges 
to 2/2 for all sequences cx, € S\ {c} which converge to c. Hence the definition of 
a “tangent hyperplane” makes geometric sense. 

It is easy to see that surfaces generated by differentiable vector functions have 
tangent hyperplanes. 
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11.22 Theorem. Suppose that V is open in R", thata € V, and that f:V > R. 
[f f is differentiable at a, then the surface 


S:= {(x,z) €R"*! = z= f(x) and xeV} 
has a tangent hyperplane at (a, f (a)) with normal 
n= (V f(a), —1) := (fr, (a), fro (@), ---, fa, (a), —1)- (12) 
An equation of this tangent hyperplane is given by 
z= f(a)+ Vf(a)- (x—a). (13) 
Proof. Let ce, € S with ce, 4 (a, f(a)) and suppose that ce, — (a, f(a)) as 


k — oo. Then cx = (ax, f(ax)) for some ag € V and a, > aask — oo. For 
\|h|| small, set e(h) = f(a+h) — f(a) — V f(a) - hand define n by (12). Since 


Ilex — el] = lla —all? + | f (ax) — f (al? = llae — all, 
it is clear by (12) that 


|e(ax — a)| 
lla, — all 


| Cc —¢ 
Ilex — el] 


Since e(h)/||h|| > 0 as h — 0, it follows from the Squeeze Theorem that n 
satisfies (11) for ¢ := (a, f(a)). Finally, (13) is an equation of this tangent 
hyperplane by the point-normal form. | 


Thus for the case n = 2, if f is differentiable at (a, b), then an equation of that 
tangent plane to z = f(x, y) at (a, b) is given by 


z= f(a,b) + —a) fra, b) + (y — b) AG, B). (14) 


Notice that (14) is completely analogous to the one variable case. Namely, if 
g is differentiable at a, then the tangent line to y = g(x) at the point (a, g(a)) is 


y=g(a)+8'@a—a). (15) 


It is interesting to note that the converse of Theorem 11.22 is also true (see 
Theorem 11.27 below). 

There is another analogy between Df and f’ worth mentioning. Recall that if 
f is areal function, then the change in y = f(x) as x moves from a to a + Ax is 
defined by Ay = f(a+Ax)-— f(a). For many concrete situations, it is convenient 
and useful to approximate Ay by the Leibnizian differential dy := f’(a) dx, 
where dx = Ax is a small real number (see Figure 11.2). Does a similar situation 
prevail for functions on R”? 
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AY 
y=f) 


Ay 


Tangent at (a, f(a)) 


#Y 


a at+Ax 


FIGURE 11.2 


To answer this question, suppose that z = f(x) is a vector function from n 
variables to one variable, differentiable at a; that Az := f(a + Ax) — f(a), 
where Ax := (Ax),..., Ax,); and that dx = Ax is a vector with small norm. 
Comparing (14) and (15), we define the first total differential of a vector function 
from n variables to one variable to be 


dz:=Vf(a)-Ax:= >> La dry. 


jae 


Is dz a good approximation to Az? 


11.23 Remark. Let f : R” — R be differentiable at a and Ax = 
(Ax,,..., Ax,). Then 


Az—dz 
|| Ax|| 


as Ax— 0. 


In particular, the differential dz approximates Az. 


Proof. By definition, if f is differentiable at a, then e(h) := f(a+h) — f(a) — 
V f(a) - h satisfies e(h)/||h|| > 0 ash — 0. Since Az = f(a+h) — f(a) for 
h := Ax and dz = V f (a) - h, it follows that (Az — dz)/||Ax|| > 0as Ax—> 0. Hf 


Figure 11.2 gives us a useful geometric interpretation of the one-dimensional 
differential. Is there an analogous interpretation for two-dimensional differen- 
tials? Specifically, if z = f(x, y), does the total differential dz and the increment 
Az play an analogous geometric role in R* that dy and Ay played in R?? 

The two-dimensional picture corresponding to Figure 11.2 involves a tangent 
plane and a wedge-shaped region (see Figure 11.3). Namely, let zo = f(a, b) and 
consider the wedge-shaped region W with vertical sides parallel to the xz- and 
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z=f(~%y) 


a 
y 
<< | b+ Ay) 


(a + Ax, b + Ay) 


FIGURE 11.3 


yz-planes whose base has vertices ¢9 := (a, b, Zo), C1 := (a+ Ax, b, zo), C2 := 
(a, b+Ay, zo), €3 := (a+ Ax, b+Ay, Zo), and whose top is tangent to z = f(x, y) 
at cg. Let A represent the length of the vertical edge of W based at c;, B the 
length of the edge based at co, and C the length of the edge based at c3. If dz is to 
play the same role in Figure 11.3 that dy plays in Figure 11.2, then it must be the 
case that C = dz. This is actually easy to verify. Since the diagonals of rectangles 
bisect one another, the line segment from the intersection of the diagonals in 
the base of W to the intersection of the diagonals in the top of W must be 
parallel to the z-axis. Thus the length D of this line segment can be computed 
two ways. On the one hand, D = C/2. On the other hand, D = (A + B)/2. 
Therefore, C = A+ B. But from one-dimensional calculus, A = f(a, b) dx and 
B = fy(a, b) dy. Consequently, 


C) 0 
aE a, b)dx + ae. b) dy = dz. 
Ox oy 


C=A+B= 
We conclude that the first total differential of vector functions plays exactly the 
same role that it did for real functions. 


We close this section with some optional material about tangent planes and 
applications of the first total differential. 


Notice by Remark 11.23 that if f is differentiable at a, then the total differ- 
ential of f can be used to approximate the change of f as x moves from a to 
a+h for ||h|| sufficiently small. This suggests that the differential can be used to 
approximate a function. 
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*11.24 EXAMPLE. 


Use differentials to approximate the change of f(x,y) = x*y — y? as (x, y) 
moves from (0, 1) to (0.02, 1.01). 


Solution. Let z = x*y — y*, a = 0, and b = 1. Then dx = 0.02 and dy = 0.01. 
Since dz = 2xy dx + (x* — 3y) dy, we have 


Az © 0(0.02) — 3(0.01) = —0.03. 
Note that Az= f (0.02, 1.01) — f (0, 1) =—0.029897... is very close to —0.03. 


*11.25 EXAMPLE. 
Use differentials to approximate (5.97) /16.03. 


Solution. Let z = yx, a = 16, and b = 6. Then dx = 0.03 and dy = 
—0.03. Since 


dz= dy 4 ¥x dy, 


4x3 
we have 
6(0.03 
ze os) + /16(—0.03) + —0.054375. 
4,./(16)3 
Thus, 


z& 6V16 — 0.054375 = 11.945625. 


Note that the actual value of 5.97\/16.03 is 11.945593.... Thus our approxima- 
tion is good to three decimal places. | 


*11.26 EXAMPLE. 


Find the maximum percentage error for the calculated value of the volume of 
a right circular cylinder if the radius can be measured with a maximum error of 
3% and the altitude can be measured with a maximum error of 2%. 


Solution. The volume of a right circular cylinder is V = mr? h, where r is the 
radius and h is the altitude. Hence, the differential of V is dV = 2mrh dr + 
mr dh. Thus 


= p eee 
V r h 

Since the percentage error of a variable x is Ax/x ~ dx/x, it follows that the 

maximum percentage error in calculating the volume V is approximately 8%: 


dV 
v= 2(+0.03) + (+0.02) = +0.08. a 
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We close this section by showing that the converse of Theorem 11.22 holds. 
(The proof presented here is based on Taylor [13]!.) 


*11.27 Theorem. Let V be open in R’, let (a,b) € V, and let f : V > R. Then f 
is differentiable at (a, b) if and only if z = f (x, y) has a nonvertical tangent plane 
IT ate := (a,b, f(a, b)), in which case IT = ITy(c) for 


n= (f,(a, b), fy(a, b), —1). (16) 


Proof. If f is differentiable at (a, b), then by Theorem 11.22, z = f(x, y) has 
a nonvertical tangent plane with normal given by (16). 

Conversely, suppose that the surface S$ := {(x,y,z) : z = f(x,y) for 
(x, y) € V} has a nonvertical tangent plane /7 at ec with normal N whose third 


component is y. Since y 4 0, n:= —N/y =: (1, 72, —1) is also a normal of 
TT, Let nj,2 = (11, n2), let a := (a, b), and let h := (h, k) 4 0 be chosen so that 
a+he V. Set 


é(h) = Az— ni 2° h, 


where Az := f(a +h,b+k) — f(a,b), and observe (by the uniqueness of 
the total derivative) that if we prove that e(h)/||h|| > 0 ash — 0, then f is 
differentiable at a, and n;,2 = V f(a, b) as required. 

Set ch := (a +h, b+k, flat+th,b+k)). Since /7 is tangent to S at e, and 
since n- (Ch — ¢) = nj ,2-h— Az, Definition 11.21 implies 


n2-h— Az 


0 := 0) = SS > 
I|hh||? + (Az)? 


(17) 


as h — 0. Since the expression Q defined in (17) is a quadratic in Az, use the 
quadratic formula to solve for Az: 


—njo-h+ |Q|/(m12-h)? + (1 — Q?)||hI|/? 


a= 18 
: O2-1 (18) 
It follows that 
— hoz Thy? + (1 — ©2)Ihll2 
e(h) = Az —my9-h = —2E2 BO’ + 1Oly m2 bY +A— OH IbI Gg) 


Ori 


To estimate e(h), observe by the Cauchy—Schwarz Inequality that |n;,2 - h| < 
lIm1,2|| [hj] and that (mj,2 - hb)? + (1 — Q?)|[hI|* < ((lm,2//7 + 1 — Q) hI. 
Substituting these estimates into (19), we obtain 


leh) _ QO? Wim ail + 1Ol Vim aI? + 1 = O? 
hy ~ 1-@ ' 


Angus E. Taylor, Advanced Calculus (Boston: Ginn and Company, 1955). 
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Since Q — O ash — 0, we conclude by the Squeeze Theorem that 
é(h)/||h|| > O0ash — 0. In particular, f is differentiable at (a,b) and 


(f(a, b), fy(a, b), —1) is normal to its tangent plane there. |_| 
EXERCISES 
11.3.1. For each of the following, prove that f and g are differentiable on their 


11.3.2. 


11.3.3. 


11.3.4. 


11.3.5. 
11.3.6. 


domains, and find formulas for D(f + g)(x) and D(f - g)(x). 
a) f(x,y) =2x—4y, g(x,y)= Oe y? 

b) f(x,y) =3xy, g(x, y) =sinx — ycosy 

c) f(x, y) = (sin(xy), ylogx), g(x, y) = (y, x) 

d) f(x, y,z)=(-y,x+2), 80,y,0=@+yt+z, xyz) 


For each of the following functions, find an equation of the tangent 
plane to z = f(x, y) ate. 


a) f(x, y) =x? +2y%, c= (—1, 1,3) 
B) Gay) Say ry C= 15,0) 
c) f(x, y,z) =xz+cosy, e= (1,7,0, -1) 


Find all points on the paraboloid z = x* + y* (see Appendix D) where 
the tangent plane is parallel to the plane x + y+ z = 1. Find equa- 
tions of the corresponding tangent planes. Sketch the graphs of these 
functions to see that your answer agrees with your intuition. 


Let K be the cone, given by z = /x* + y?. 


a) Find an equation of each plane tangent to K which is perpendicular 
to the plane x + z =5. 

b) Find an equation of each plane tangent to K which is parallel to the 
plane x —y+z=1. 


Prove (7) and (8) in Theorem 11.20. 
[QUOTIENT RULE]. Suppose that f : R” — R is differentiable at a and 
that f(a) 4 0. 


a) Show that for ||h|| sufficiently small, f(a +h) 4 0. 
b) Prove that Df (a)(h)/||h|| is bounded for all h € R"\ {0}. 
c) If T := —Df (a)/f?(a), show that 


1 L pay — £@ = f@+h + FM) 
f(ath) f(a) f(a) f(a+h) 
_(f@th) — f@)Dfah) 
| f(a) f(a +h) 


for ||h|| sufficiently small. 
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d) Prove that 1/f(x) is differentiable at x = a and 


1 _ _ Df(a) 
(7) = Fa 


e) Prove that if f and g are real-valued vector functions which are 
differentiable at some a, and if g(a) 4 0, then 


a (4) pe HAGE f(a)Dg(a) 
g g(a) 


11.3.7. [CRoss-Propuct RULE]. Suppose that V is open in R”, that f,g: V > 
R?, and that a € V. Iffand g are differentiable at a, prove that f x g is 
differentiable at a and 


Dif x g)(a)(y) = fla) x (Dg(a)(y)) — g(a) x (Df(a)(y)) 


for ally € R”. 
*11.3.8. Compute the differential of each of the following functions. 


2 2 A4xy 
a)z=4x° + 9y b) z = cos(xy) 07 FEA 
*11.3.9. Let w = y?z* — x. Use differentials to approximate Aw as (x, y, z) 
moves from (3,2,1) to (3.04, 1.98, 1.10). Compare your approximation 
with the actual value of Aw. 
*11.3.10. The time T it takes for a pendulum to complete one full swing is 
given by 


L 
P= 20), 

g 
where g is the acceleration due to gravity and L is the length of the 
pendulum. If g can be measured with a maximum error of 1%, how 
accurately must L be measured (in terms of percentage error) so that 
the calculated value of T has a maximum error of 2%? 

*11.3.11. Suppose that 


W x y z 


where each variable x, y, z can be measured with a maximum error of 
p%. Prove that the calculated value of w also has a maximum error 
of p%. 


11.4 THE CHAIN RULE 


Here is the Chain Rule for vector functions. 
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11.28 Theorem. [CHAIN RULE]. 
Suppose that f and g are vector functions. If g is differentiable at a and f is 
differentiable at g(a), then f o g is differentiable at a and 


D(f 0 g)(a) = Df(g(a)) Dg(a). (20) 
[The product Df(g(a)) Dg(a) is matrix multiplication. | 


Proof. To specify the dimensions, suppose that a €¢ R”, b := g(a) € R”, 
and f(b) € R?. Set T = Df(g(a)) Dg(a) and observe that T, the product of a 
p X m matrix with an m x n matrix, is a p x n matrix, the right size for the 
total derivative of fo g. By the uniqueness of the total derivative, we must 
show that 


f(g(a + h)) — f(g(@)) — Th) _ 


Mu) [| ° ” 
Set 
e(h) = g(a +h) — g(a) — Dg(a)(h), (22) 
and 
5(k) = f(b + k) — f(b) — Df(b) (k) (23) 


for ||h|| and ||k|| sufficiently small. By hypothesis, e(h)/||h|| — 0 in R” as 
h > 0in R’, and 6(k)/||k|| ~ 0in R? ask — 0 in R”. Fix h small and set 
k = g(a+h) — g(a). Since (23) and (22) imply 


f(g(a + h)) — f(g(a)) = f(b + k) — f(b) = Df(b)(k) + 5(k) 
= Df(b)(Dg(a)(h) + e(h)) + 5(k) 
= Tih) + Df(b)(e(h)) + 4(k), 


we have f(g(a + h)) — f(g(a)) — T(h) = Df(b)(e(h)) + 5(k) =: Ti(h) + To(h). 
It remains to verify that T;(h)/||h|| > 0 ash — 0 for j = 1, 2. 

Since e(h)/||h|| — 0 ash — 0 and Df(b)(h) is matrix multiplication, it is 
clear that T; (h)/||h|| — Df(b)(0) = 0 ash — 0. On the other hand, by (22), 
the triangle inequality, and the definition of the operator norm, we have 


I|k|| := llg(a + h) — g(a)|I = || Dg(a)(hb) + e(h)|| < || Dg(a)|l - [hl + Hle@)|l. 


Thus ||k||/||h|| is bounded for ||h|| sufficiently small. Since k = 0 implies 
||T2(h)|| = 0, we may suppose that k 4 0. Since k > 0 implies h — 0, it 
follows that 


Po(ayl| IKI 6d) 
|||] [hi| [IKI 


as h > 0. We conclude that f o g is differentiable at a and the derivative is 
Df(g(a)) Dg(a). a 


> 0 
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The Chain Rule can be used to compute individual partial derivatives with- 
out writing out the entire matrices Df and Dg. For example, suppose that 
f(,...,Um) is differentiable from R” to R, that g(x1,..., x,) is differentiable 
from R” to R”, and that z = f(g(x1,...,%)). Since Df = Vf and the jth 
column of Dg consists of first partial derivatives, with respect to x;, of the com- 
ponents uz := gx(X1,..-, Xn), it follows from the Chain Rule and the definition 
of matrix multiplication that 


dz Of du, | _ Of dum 
Ox; ~ Ouy OX j OUm OX; 
for j = 1,2,...,n. Here are two concrete examples which illustrate this 


principle. 


11.29 EXAMPLES. 
i) If F,G,H: R* —> Rare differentiable and z = F(x, y), where x = G(r, 6), 
and y = A(r, 9), then 


dz dzdx | dz dy a dz dzdx dz dy 
dr dx dr dy ar 00 dxd0 dy dd 


ii) If f : RR? > Rand ¢,y~,o : R > Rare differentiable and w = f(x, y, 2), 
where x = f(t), y= W(t), and z = o(f), then 


dw dwdx | dwdy | dwdz 
dt dxdt dydt dz dt 


EXERCISES 


11.4.1. Let F : R? > Rand f,g,4: R* > RbeC? functions. If w = F(x, y, z), 
where x = f(p,q), y = g(p,q), and z = h(p,q), find formulas for 
Wp, Wg, and Wpp. 

11.4.2. Let r > 0, leta € R”, and suppose that g : B-(a) > R” is differentiable 
at a. 


a) If f : B-(g(a)) > R is differentiable at g(a), prove that the partial 
derivatives of h = f og are given by 


oh 0 
(a) = Veta) <8 (a) 
Xj Ox; 


for j =1,2,...,n. 
b) Ifn =m andf: B,(g(a)) — R" is differentiable at g(a), prove that 


det(D(f o g)(a)) = det(Df(g(a))) det(Dg(a)). 


11.4.3. 


11.4.4. 


11.4.5. 


11.4.6. 


11.4.7. 
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Suppose that k € N and that f : R” — R is homogeneous of order 
k; that is, that f(ox) = p* f(x) for allx € R” andall p € R. If f is 
differentiable on R”, prove that 


af 
OXn 


0 
Ace to + Hew = Fw) 
Ox 
for all x = (x1,...,%,) € R". 
Let f,g : R — R be twice differentiable. Prove that u(x, y) := f(xy) 
satisfies 


and v(x, y) := f(x — y) + g(x + y) satisfies the wave equation; that is, 


a2v a2 


x2 ay? = 


Let f,g : R* > R be differentiable and satisfy the Cauchy—Riemann 
equations; that is, that 


df _ 9g af dg 
—=— and 
ax dy dy Ox 


hold on R?. If u(r, 0) = f(rcos6, r sin) and v(r, 6) = g(r cos6,r sin@), 
prove that 


au lav av au 


ae eu. & 0. 
ar or a0 eo rae. 


Let f : R*? — R be C? on R’ and set u(r, 0) = f(rcosé,rsing). If f 
satisfies the Laplace equation; that is, if 


a? a? 
arf + es — 0, 
ax2 = dy” 


prove for each r # 0 that 


102u 1ldu  d2u 


s=—s+H+- e+ Ch H= 0. 
r2 002 or Or or2 
Let 
—x?/4t 
u(x,t) = ; t>0,xeER. 
V4 t 


a) Prove that u satisfies the heat equation (i.e., ux, —u; = 0 for allt > 0 
and x € R). 
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11.4.8. 


11.4.9. 


11.4.10. 


11.4.11. 


Differentiability on R” 
b) If a > 0, prove that u(x,t) > 0, ast — 0+, uniformly for x ¢€ 


[a, &). 
Let u : R — [0, 0) be differentiable. Prove that for each (x, y, z) # 


(0, 0, 0), 
ros.y.:eu( e+e +2) 


satisfies 


(SE) +) +) <b 0) 


Suppose that z = F(x, y) is differentiable at (a, b), that Fy(a,b) 4 0, 
and that J is an open interval containing a. Prove that if f : J — Ris 
differentiable ata, f(a) =b, and F(x, f(x)) = 0 for all x € J, then 


df _ ax (a, b) 

Pree 
—(a, b) 
dy 


Suppose that J is a nonempty, open interval and that f : J] > R” is 
differentiable on J. If f(7) C 0B,(0) for some fixed r > 0, prove that 
f(t) is orthogonal to f(t) for all t € J. 

Let V be open in R", a € V, f : V — R, and suppose that f is 
differentiable at a. 


a) Prove that the directional derivative Dy f(a) exists (see Exer- 
cise 11.2.10) for each u € R” such that |lul| = 1 and Dyf(a) = 
V f(a) -u. 

b) If V f(a) 4 0 and 6 represents the angle between u and V f (a), prove 
that Dy f(a) = ||V f(a)|| cos 6. 

c) Show that as u ranges over all unit vectors in R”, the maximum of 
Dy f (a) is ||V f (a)||, and it occurs when u is parallel to V f (a). 


11.5 THE MEAN VALUE THEOREM AND TAYLOR'S FORMULA 


Using Df as a replacement for f’, we guess that the multidimensional analogue 
of the Mean Value Theorem is f(x) — f(a) = Df(c)(x — a) for some ec “between” x 
and a; that is, some c € L(x; a), the line segment from a to x. The following result 
shows that this guess is correct when f is real valued (see also Exercises 11.5.6 
and 11.5.9). 


11.30 


Theorem. [MEAN VALUE THEOREM FOR REAL VALUED 


FUNCTIONS]. 
Let V be open in R" and suppose that f : V — Ris differentiable on V. If 


xX,ae 


V and L(x; a) C V, then there is ac € L(x; a) such that 


f(x) — f(a) = Vf(©)-(x-—Aa). (24) 
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Proof. Let 
g(t) =a+t(x—a), teR, 


and notice by Exercise 11.2.8 that g: R > R’” is differentiable with Dg(t) = 
x — a for all t € R. Since L(x; a) C V and V is open, choose 6 > 0 such that 
g(t) € V for allt € Js := (—d6, 1 + 8). By the Chain Rule, 


D(f og)(t) = Df(g@)(x — a), ted. (25) 


But fog: Js > Risa real function and f is real valued, so D(f og) = (f 0g)’ 
and Df = Vf. Hence, by the one-dimensional Mean Value Theorem and 
(25), there is a tg € (0, 1) such that 


f(x) — f(a) = (f og)() — (f 0g)(0) = (f 08)'() = VFO: (x—a) 
for c = g(fo). | 


The following result shows that this result does not extend to vector-valued 
functions, even when the domain is one dimensional. 


11.31 Remark. The function f(t) = (cost, sint) is differentiable on R and satis- 
fies f (2x) = f (0), but there is no c € R such that Df (c) = (0, 0). 


Proof. Df (t) = (—sint, cost) exists and is continuous for t € R but (0,0) 4 
(—sint, cost) fort € R. | 


But any vector-valued function f can be turned into a scalar-valued function 
by taking the dot product of f with any vector u. Combining this observation 
with Theorem 11.30, we obtain the following multivariable version of the Mean 
Value Theorem. 


11.32 Theorem. [MEAN VALUE THEOREM FOR VECTOR VALUED 
FUNCTIONS]. 

Let V be open in R" and suppose that f : V — R" is differentiable on V. If 
x,a € V and L(x; a) C V, then given any u € R” there is ae € L(x; a) such 
that 


u- (f(x) — f(a)) =u- (Df(©)(K — a)). 
Proof. Let u € R”. Set h(x) := u- f(x) and observe by the Dot Product Rule 
[see (9) above] that 
Dh(x) = u- D(f)(x) 


for all x € V. Since h is scalar valued, it follows from Theorem 11.30 that 
there is ac € L(x; a) such that 


u- (f(x) — f(a)) = h(x) — h(a) = Dh(c)(x — a) = u- (Df(c)(x — a)). a 
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Sets which satisfy the hypothesis “L(x; a) C V” come up often enough to 
watrant a name. 


11.33 Definition. 


A subset EF of R” is said to be convex if and only if L(x; a) C E for allx,ae E. 


Using this terminology, we see that the Mean Value Theorems above hold for 
any C! function on a convex, open set V. 

It is easy to see that balls and rectangles are convex. For example, if x,a € 
B,(b), then 


(1 — tha + tx) —bl| = | —H(a—b)+t(x—-Db)|| < d-frt+tr=r. 


On the other hand, Figure 11.4 is an example of a nonconvex set in R* (because 
the line segment which joins a to b contains some points outside V). 


FIGURE 11.4 


As in the one-dimensional case, the Mean Value Theorem is used most often 
to obtain information about a function from properties of its derivative. Here is 
a typical example. 


11.34 Corollary. Let V be an open set in R", let H be a compact subset of V, and 
suppose that f : V —> R” is C! on V. If E is a convex subset of H, then there is a 
constant M (which depends on H and f but not on E) such that 


I|fx) — f(a) || < M||x — all 


forallx,ae E. 


Proof. Since H is compact and the entries of Df are continuous on H, we 
have by the Extreme Value Theorem (Theorem 9.57 or 10.63) and the proof 
of Theorem 8.17 that the operator norm of Df is bounded on H;; that is, that 


M := sup ||Df(c)|l 
cceH 


is finite. Notice that M depends only on H and f. 
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Let x,a € FE andu = f(x) — f(a). Since E is convex, L(x; a) C E. Hence, by 
Theorem 11.32, there is ac € L(x; a) such that 


IIf(x) — f(@)|? = u- (f(x) — f(a) = u- (Df (x — a) 
(f(x) — f(a)) - (DE(©)(« — a)). 


It follows from the Cauchy—Schwarz Inequality and the definition of the 
operator norm that 


If) — f(a) ||? < ||f(x) — f(@)|| || DE(©)|| Ix — al]. 


If ||fCx) — f(a)|| = 0, there is nothing to prove. Otherwise, we can divide the 
inequality above by ||f(x) — f(a)|| to obtain 


IIx) — f(a) || < || DE©|| lx — al] < M||x — all. a 


As the following optional result shows, for some applications of the Mean 
Value Theorem, the convexity hypothesis can be replaced by connectivity. (This 
is an analogue of the one-dimensional result: If f’ = 0 on [a,b], then f is 
constant on [a, b].) 


*11.35 Corollary. Suppose that V is open and connected in R" and thatf: V > 
R” is differentiable on V. If Df(c) = O for alle € V, then f is constant on V. 


Proof. Fix a € V, and let x € V. Since V is open and connected, V is polyg- 
onally connected (see Exercise 9.7.10 or 10.4.10). Thus, there exist points 
Xo = a,X),...,X¢ = x such that L(x;-1;xj;) C V for j = 1,2,...,k (see 
Figure 11.4). 

Let u = f(x) — f(a) and choose by Theorem 11.32 points e; € L(x;-1; xj) 
such that 


u- (f(x;) — f(xj-1)) = u- (Df(c;)(Kj — xj-1)) =0 


for j = 1,2,...,k. Summing over j and telescoping, we see by the choice of 
u that 


k 
0= Sou- (f(x;) — f(xj-1)) = u- (f(x) — f(a) = ||f6<) — fla)’. 


j=l 
Therefore, f(x) = f(a). a 


To obtain a multidimensional version of Taylor’s Formula, we need to define 
higher-order differentials. Let p > 1, let V be open in R”, let a € V, and let 
ff: VY —R. We shall say that f has a pth-order total differential at aif and only 
if the (p — 1)-st order partial derivatives of f exist on V and are differentiable 
at a, in which case we call 
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D® flash) = So ee = (ahi--:hj,, N= (h1,--.4hn) @R" 
iy: ip 


ij=1 ip= 
the pth-order total differential of f at a. Notice that 


D” f(a; h) = DY (D?~” f)(a; h) 


for p > 1. Also notice that if z = f(x), then D“) f(a; Ax) is the first total 
differential dz defined in Section 11.3, and also is the total derivative of f ata 
evaluated at Ax: 


n 
D f(a; Ax) := ae ao Ax; = Vf(a)- Ax = Df (a)(Ax). 
j= 19 
For the case n = 2, this differential has a simple geometric interpretation (see 
Figure 11.3 above). 
Although total differentials look messy to evaluate, when f is a sufficiently 


smooth function of two variables, they are relatively easy to calculate using 
binomial coefficients (see the next example and Exercise 11.5.2). 


11.36 EXAMPLE. 


Suppose that f : V > Ris C? on V. Find a formula for the second total differ- 
ential of f at (a,b) € V. 


Solution. By definition, 


(2) 2 a f af 
D® f (a, b); (h, k)) = h a 5 (a, b) 4 t hk s (a, b) 

a2 2 

af 20° f 
ee OO as x (a, b). 

But by Theorem 11.2, f,(a, b) = fyx(a, b). Therefore, 
2 
DO f(a, b); (h, k) = he La b) 4 one 2d 7 ~ (a,b) +S (a,b), a 

y 


Thus the second total differential of f(x, y) = (xy)? is 
D® f(x, y); (h, k)) = 2y7h? + Bxyhk + 2x7K. 


Here is a multidimensional version of Taylor’s Formula. 
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11.37 Theorem. [TAYLOR’S FORMULA ON R"]. 

Let p €N, let V be open in R", let x,a € V, and suppose that f : V > R. If 
the pth total differential of f exists on V and L(x; a) C V, then there is a point 
c € L(x; a) such that 


p-l 
1 1 
f(x) = flat » aD fa h) + ae re h) 


forh:=x-a 


NOTE: These hypotheses are met if V is convex and f is C? on V. 


Proof. Let h = x —a. As in the proof of Theorem 11.32, choose 6 > 0 so 
small thata+ th € V fort € Js := (—6, 1+). The function F(t) = f(a-+ th) 
is differentiable on J; and, by the Chain Rule, 


F(t) = Df(a+th)(h) = )~ ao + th) he. 
k=1 


In fact, a simple induction argument can be used to verify that 


. a/ f 
POG = ho Se th) hj, --- hi, 
() Xu 5s ie hi, hi, 
y= — @ 
for j =1,2,..., p. Thus 
FXO) = D” fash) and F(t) = D” f(a+th;h) (26) 


for j=1,...,p—l,andt ce Js. 

We have proved that the real function F has a derivative of order p 
everywhere on J; D [0, 1]. Therefore, by the one-dimensional Taylor Formula 
and (26), 


p-l 
1, 1 
f(x) — f(a) = Fd) — FO) = > — F‘)(0) he — F'P)(t) 

j=l 

p-1 1 I 

= >> =D" fh) + =D” f(a th; h) 
jou Ps 
for some t € (0, 1). Thus sete = a+ th. = 


11.38 EXAMPLE. 
Write Taylor’s Formula for f(x, y) = cos(xy), a = (0,0), and p = 3. 
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Solution. It is easy to verify that fr, f\, frx, fry, and fyy are all zero at (0, 0), 
so D f((0,0); x, y)) = 0 and D f((0, 0); (x, y)) = 0. Since frxx(x, y) = 
yesin(xy), frxy = —2y cos(xy) + xy? sin(xy), fryy = —2x cos(xy) + x7y sin(xy), 
and fyyy = x%cos(xy), Theorem 11.2 implies DS) f((c,d); (x, y)) = GF + 
y?) sin(cd) + 3(x*y + xy”) sin(cd) — 6(x + y) cos(cd). Thus by Taylor’s Formula, 
for all (x, y) € R* there is a point (c, d) on the line segment between (0, 0) and 
(x, y) such that 


3 3 2 2 
ia ) sin(cd) + (AS *) sin(cd) — (x + y) cos(cd). 


cos(xy) = 1+ ( 


EXERCISES 


11.5.1. a) Write out an expression in powers of (2x — 1) and (y + 1) for 
f(x, y) = 4x? + 2xy + y?. 
b) Write Taylor’s Formula for f(x,y) = /x + ¥y, a = (1,1), and 
p=3. 
c) Write Taylor’s Formula for f(x, y) = e*~”, a = (0,0), and p = 4. 
11.5.2. Suppose that f : R? > Ris C? on B,(xo, yo) for some r > 0. Prove 


that, given (x, y) € B,(xo, yo), there is a point (c, d) on the line segment 
between (xg, yo) and (x, y) such that 


p-l 1 k k ; ; at f 
f(x,y) = F090) + 2g (i) x0)! (Yo) SH ez Os YO) 


ere . _.  gP 
+— (?)o xo)/(y = yo)? Fe, d). 
p! 20 J dx/ Oy 


11.5.3. Suppose that f :R” > Randg: R” — R’ are differentiable on R” and 
that there exist r > 0 anda € R” such that Dg(x) is the identity matrix, 
I, for all x € B,(a). Prove that there is a function h : B,(a) \ {a} > 
B,(x) such that 


If (g(x) — f(g(a))| 


\|x — all 


< |DF(goh)))I| 


for all x € B,(a) \ {a}. 

11.5.4. Suppose that V is convex and open in R” and that f : V > R” is dif- 
ferentiable on V. If there exists ana € V such that Df(x) = Df(a) for 
all x € V, prove that there exist a linear function S « £(R”; R”) anda 
vector c € R” such that f(x) = S(x) +c for allx € V. 

11.5.5. [INTEGRAL ForM OF TayLor’s ForMuLA]. Let p € N, V be an open set in 
R", x,ae V,and f: V > RbeC? on V. If L(x; a) Cc V andh=x~—a, 
prove that 


11.5.6. 


11.5.7. 


11.5.8. 


11.5.9. 


11.5.10. 


11.5.11. 
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p-1 I 


1 1 
—f(a)=)> —D® f(a;h — | 1-1)?! p®) h; h) dt. 
f(x)—f (a) ars fla Ga a t) f(a+th; h) dt 


Letr > 0, a,b € R, f : B,(a,b) > R be differentiable, and (x, y) € 
B,(a, b). 


a) Let g(t) = f(tx + (1 —n)a, y)+ f(a, ty + d — 2b) and compute the 
derivative of g. 

b) Prove that there are numbers c between a and x, and d between b 
and y such that 


fay-f@ab=a-aAfl y+ -b)fG 4d). 


(This is Exercise 12.20 in Apostol [1].) 


Suppose that 0 <r < 1 and that f : B,;(0) — R is continuously differ- 
entiable. If there is an a > O such that | f(x)| < ||x||* for all x € B,(0), 
prove that there is an M > 0 such that | f(x)| < M||x|| for x € B,(0). 
Suppose that V is open in R”, that f : V > R is C? on V, and that 
fx; (a) = 0 for some a € H and all j = 1,...,n. Prove that if H isa 
compact convex subset of V, then there is a constant M such that 


lf (x) — f(a)| < M||x— all? 


for allx € H. 

Let f : R” — R. Suppose that for each unit vector u € R”, the direc- 
tional derivative Dy f (a+ tu) exists for t € [0, 1] (see Definition 11.19). 
Prove that 


f(a+u)— f(a) = Duf(at tu) 
for some t € (0, 1). 
Suppose that V is open in R’, that (a,b) € V, and that f : V > RisC? 
on V. Prove that 


20 
lim —= f(atrcos6, b+r sin 6) cos(20) dO = frx(a, b) — fyy(a, b). 
0 


Suppose that V is open in R’, that H =[a, b]x[0,c] C V,thatu: V>R 
is C* on V, and that u(xo, to) > 0 for all (xo, fo) € OH. 


a) Show that, given ¢ > 0, there is a compact set K C H? such that 
u(x,t) > —e forall (x,t)e H\ K. 

b) Suppose that u(x, t;) = —€ < 0 for some (x1, t)) € H°, and choose 
r > 0so small that 2rt; < €. Apply part a) toe := ¢/2 —rt, to 
choose the compact set K, and prove that the minimum of 


w(x, t) := u(x,t) +r(t — th) 


on A occurs at some (x2, f2) € K. 
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c) Prove that if u satisfies the heat equation (i.e., ux, — uy = Oon V), 
and if u(xo, to) > O for all (xo, %) € OH, then u(x,t) > 0 for all 
(x,t) eH. 


11.5.12. a) Prove that every convex set in R” is connected. 
b) Show that the converse of part a) is false. 
*c) Suppose that f : R — R. Prove that f is convex (as a function) if 
and only if E := {(x, y): y > f(x)} is convex (as a set in R”). 


11.6 THE INVERSE FUNCTION THEOREM 


By the one-dimensional Inverse Function Theorem (Theorem 4.33), if g:R->R 
is 1-1 and differentiable with g’(xo) 4 0, then g~! is differentiable at yo = 
g(xo) and 


(g¢~')'(yo) = 


8' (Xo) 

In this section we obtain a multivariable analogue of this result (i.e., an Inverse 
Function Theorem for vector functions f from n variables to n variables). What 
shall we use for hypotheses? We needed g to be 1-1 so that the inverse function 
g~! existed. For the same reason, we shall assume that f is 1-1. We needed 
g'(xo) to be nonzero so that we could divide by it. In the multidimensional case, 
Df(a) is a matrix; hence “divisibility” corresponds to invertibility. Since ann xn 
matrix is invertible if and only if it has a nonzero determinant (see Appendix C), 
we Shall assume that the Jacobian of f 


Ag(a) := det(Df(a)) 0. 


The word Jacobian is used because it was Jacobi who first recognized the impor- 
tance of A¢ and its connection with volume (see Exercise 12.4.6). 

The proof of the Inverse Function Theorem on R” is not simple and lies some- 
what deeper than the preceding results of this chapter. Before presenting it, 
we first prove two preliminary results which explore the consequences of the 
hypothesis Arp 4 0. 


11.39 Lemma. 

Let V be open and nonempty in R" and let f : V — R" be continuous. If f is 
1-1 and has first-order partial derivatives on V, and if Ag # 0 on V, then f~' is 
continuous on f(V). 


StraTeGy: To prove that f~! is continuous on V, it suffices to prove (apply 
Exercise 9.7.3 or Theorem 10.58 to f—!) that f(W) = (f-')—!(W) is open for all 
open W C V. Thus, given b € f(W), say b = f(a) for some a € W, we must find a 
p > Osuch that B,(b) C f(W). We will actually show more: that if B,(a) C W for 
some r > 0, then there is a p > 0 such that B,(b) C f(B;(a)); that is, y € B,(f(a)) 
implies that y = f(c) for some c € B, (a). 

Where should we look to find such a e? To show that f(c) — y = 0, we might 
first try finding a point c € B,(a) that minimizes ||f(x) — y|| as x ranges over B,.(a) 
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and then try showing that the minimum value is actually zero. It is relatively 
easy to prove that the minimum value is zero using the hypothesis that Ar # 0. 
Moreover, since ||f(x) — y|| is continuous on the compact set B; (a), the minimum 
value must be attained by some ec, but c might belong to the boundary of B,(a), 
not its interior. By controlling the size of o, we can keep the minimum value 
from occurring on the boundary. Here are the details. 


Proof. Suppose that W is an open subset of V and let b € f(W). Choose a € W 
such that b = f(a). Since W is open, choose 7p > 0 such that B,,(a) C W, and 
observe for any rr € (0, ro) that 0B,(a) C B,(a) C W. 

Since fis 1-1 on W, the real-valued function 


(x) := |lf(x) — f(a)|l, x € 0B,(a), 


is positive on the compact set 0B,(a). Since f is continuous on W, it follows 
from the Extreme Value Theorem that g attains a positive minimum on 
0B,(a); that is, 


m= inf sae? 


xe0B,(a 
Set p = m/2 and fix y € B,(f(a)). To show that y ¢€ f(B,(a)), notice that 
since the function h(x) := ||f(x) — y|| is continuous on the compact set B, (a), it 


also attains its minimum there. Thus there is ac € B,(a) such that h(c) < h(x) 
for all x € B,(a). 

To show that ¢ € B,(a), suppose to the contrary that ec ¢ B,(a); that is, that 
c € 0B,(a). Then ||f(c) — f(a)|| > m = 2p. Since y € B,(f(a)) implies that 
h(a) = ||f(a) — y|| < e, the minimum of ||/(x)|| must also be less that p; that is, 
p > h(c). Therefore, it follows from the triangle inequality that 


p > h(c) = ||f(©) — yll 2 If) — f(a)|| — f(a) — yll > 20 -p =~, 


a contradiction. Thus c € B,(a). 
It remains to prove that y = f(c). Notice that, since h(c) > 0, h7(c) is the 
minimum of h? on B,(a). Thus, by one-dimensional calculus, 


ah 
—(c) =0 
OX, 


fork =1,...,n. Since h?(x) = Via Fj) — yj)’, it follows that 


afi 


OXk 


1 an? : 
0=55,,0= 2 ® — ya. 


This is a system of n linear equations in n unknowns, f;(c) — y;. Since the 
matrix of coefficients of this system has determinant A¢(c) 4 0, it follows 
from Cramer’s Rule (see Appendix C) that this system has only the trivial 
solution; that is, f;(c) — yj = O for all j = 1,...,n. In particular,y =f(c). 
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Our second preliminary result shows that if the Jacobian of a continuously 
differentiable function f is nonzero at a point, then f must be 1-1 near that 
point. (This will provide a key step in the proof of the Inverse Function The- 
orem below.) 


11.40 Lemma. 
Let V be open in R" andf : V > R" be C! on V. If Ag(a) 4 0 for someae V, 
then there is anr > 0 such that B,(a) C V, fis I-1 on B,(a), Ag(x) 4 0 for all 
x € B,(a), and 


J 


Of; 
det | Z | #0 
Ox nxn 
for alley, ...e, € B,(a). 


STRATEGY: The idea behind the proof is simple. If f is not 1-1 on some B, (a), 
then there exist x, y € B,(a) such that x # y and f(x) = f(y). Since L(x; y) C 
B,(a), we have by Theorem 11.30 (the Mean Value Theorem) that 


n 


Ofi 
0= fily) — fi(x) = eon) (27) 


k=1 


for x = (x1,...,X%n), Y = (1,---, Wn), G € L(x y), andi = 1,...,n. Notice 
that (27) is a system of n linear equations in n unknowns, (yz, — xx). If we can 
show, for sufficiently small r, that the matrix of coefficients of (27) has nonzero 
determinant for any choice of ¢; € B,(a), then by Cramer’s Rule the linear 
system (27) has only one solution: yz — x, = 0 fork = 1,...,n. This would 
imply x = y, a contradiction. Here are the details. 


Proof. To show that there is anr > O such that the matrix of coefficients of the 
linear system (27) is nonzero for alle; € B,(a), let V™ = Vx---x V represent 
the n-fold Cartesian product of V with itself, and define h : V —> R by 


Ofi 
h(x], X2,..., Xn) = act] fi «x)| : 
Ox; nxn 


Since the determinant of a matrix is defined using products and differences 
of its entries (see Appendix C), we have by hypothesis that / is continuous 
on V™. Since h(a,...,a) = Ag(a) # 0, it follows that there is anr > 0 
such that B,(a) C V and h(cy,...,¢n) 4 0 fore; € B,(a). In particular, the 
matrix of coefficients of the linear system (27) is nonzero for all ¢; € B,(a), 
and A(x) = A(x,..., x) #0 for all x € B,(a). a 


We now prove a multidimensional version of the Inverse Function Theorem. 
11.41 Theorem. [THE INVERSE FUNCTION THEOREM]. 


Let V be open in R" andf : V > R" be C! on V. If Ag(a) 4 0 for someae V, 
then there exists an open set W containing a such that 
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i) fis 1-1 on W, 
ii) f-! isC! on f(W), and 
iii) foreach y € f(W), 
D(f')(y) = [DFE (y)), 


where [ |~! represents matrix inversion (see Theorem C.5). 


Proof. By Lemma 11.40, there is an open ball B centered at a such that f is 
1-1 and Ar 4 0 on B, and 


A:= det| OF 
0 


J 


| #0 


for all ¢; € B. Let Bo be an open ball centered at a which is smaller than B 
(i.e., the radius of Bo is strictly less than the radius of B). Then Bo C B, fis 
1-1 on Bo and, by Lemma 11.39, f~ ' is continuous on f(Bo). 

Let W be any open ball centered at a which is smaller than Bo. Then fis 1-1 
on W and f(W) is open. To show that the first partial derivatives of f~! exist 
and are continuous on f(W), fix yo € f(W) and 1 <i,k <n. Chooset € R \ {0} 
so small that yo + te, € f(W), and choose xo, x; € W such that x9 = f~!(yo) 
and x; = f~!(yo + te;). Observe that for each i = 1,2,...,7, 


_ 
fil!) — fil%) = i j iy 


Hence, by Theorem 11.30 (the Mean Value Theorem), there exist points ¢; € 
L(xo; X1) such that 


ey Xia Xo _ fii) — fio) Jl kai —- 
V fi(ci) A oF : {5 bei FS 1,2 )c3i $8) 


Let xe ) (respectively, x0 )) denote the jth component of xo (respectively, x;). 


Since (28) is a system of n linear equations in n variables ese vb x ) /t whose 
coefficient matrix has determinant A (which is nonzero by the choice of B), 
we see by Cramer’s Rule that the solutions of (28) satisfy 


(f)j(yo + tex) — EY j(y0) xy? — x6? 
t a t 


= Q;(t), (29) 


where Q(t) is a quotient of determinants whose entries are Os or 1s, or first- 
order partial derivatives of components of f evaluated at the ¢;’s. Since t > 0 
implies xj > xo, ¢; > xo, and yo + tex — yo, it follows that Qj (t) converges 
to Q;, a quotient of determinants whose entries are Os or 1s, or first-order 
partial derivatives of components of f evaluated at xg = f~ '(yo). Since f~! is 
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continuous on f(W), Q; must be continuous at each yo € f(W). Taking the 
limit of (29) as t + 0, we see that the first-order partial derivatives of (f~!); 


exist at yo and equal Q;; that is, f~ lis continuously differentiable on f(W). 
It remains to verify iii). Fix y € f(W), and observe, by the Chain Rule and 
Exercise 11.2.8, that 


I = DI(y) = Df of ')(y) = DEE '(y)) DE '(y). 
By the uniqueness of matrix inverses, we conclude that 
De ')(y) = [Dt "(y)). 7 
Of course, the value Df~'(y) is not unique because f~! may have several 
branches. For example, if f(x) = x*, then f—'(1) = 41, depending on whether 
we take the inverse of f(x) near x = 1orx =—1. 


11.42 Remark. The hypothesis “Ag 4 0” in Lemma 11.39 can be relaxed. 


Proof. If f(x) = x3, then f : R > R and its inverse f~!(x) = %/x are 
continuous on R, but A /(0) = f’(0) = 0. | 


11.43 Remark. The hypothesis “Ag #4 0” in Theorem 11.41 cannot be relaxed. 
In fact, if f : B,(a) — R" is differentiable at a and its inverse f—! exists and is 
differentiable at f(a), then Ag(a) 4 0. 


Proof. Suppose to the contrary that fis differentiable at a but A¢(a) = 0. By 
Exercise 11.2.8 and the Chain Rule, 


I = Dif! of)(a) = Dif!) (f(a) D(a). 
Taking the determinant of this identity, we have 
1 = Arg-i (f(a)) Ar(a) = 0, 
a contradiction. a 


11.44 Remark. The hypothesis “f is C! on V” in Theorem 11.41 cannot be 
relaxed, even when f is a real function. 


Proof. If f(x) = x + 2x? sin(1/x), x #0, and f(0) = 0, then f : R > Ris 
differentiable on V := (—1, 1) and f’(0) = 1 4 0. However, since 


Carlee ace 
ING ta) PGE Das) Vee 


fork € N, f is not 1-1 on any open set which contains 0. Therefore, no open 
subset of f(V) can be chosen on which f~! exists. | 
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Although Theorem 11.41 says f must be 1-1 on some subset W of V, it does 
not say that fis 1-1 on V. 


11.45 Remark. The set W chosen in Theorem 11.41 is, in general, a proper subset 
of V, even when V is connected. 


Proof. Set f(x, y) = (x?—y7, xy) and V = R*\{(0, 0)}. Then Ag = 2(x?+y’) # 
0 for (x, y) € V, but f(x, —y) = f(—x, y) for all (x, y) € R*. Thus f is not 1-1 
on V. | 
Sometimes vector functions from p variables to n variables are defined 
implicitly by relations on R’*”. On rare occasions, such a relation can be solved 


explicitly as follows. 


*11.46 EXAMPLE. 


If 5 + 55 + is = land xo # 0, prove that there exist an r > 0 and a function 
g(s,t), continuously differentiable on B,(so, to), such that xo = g(so, to) and 


r4+s4rP=1 
for x = g(s,t) and (s, t) € B,(so, to). 
Proof. Solve x” +s? +t? = 1 for x to obtain 
x =+VJV1—s2—-1??. 


Which sign shall we take? If x9 > 0, set g(s,t) = V1—s?2—1f?. By the 
Chain Rule, 


OA > —s an Og —t 
ds JI —-s 2 a VI—s— 


Thus g is differentiable at any point (s,t) which lies inside the two- 
dimensional unit ball (i.e., which satisfies 5? + 1° < 1). Since x3 + 56 +16 =1 
and x9 > 0, (so, fo, x9) lies on the boundary of the three-dimensional unit ball 
in stx space a distance xp units above the st plane (see Figure 11.5). In partic- 


ular, ifr := 1—,/1 —x¢ and (s, t) € B,(so, to), then s* +7? < 1. Therefore, g is 
continuously differentiable on B,(so, to). If xo < 0, a similar argument works 


for g(s,t) = —V1—s* —??. a 


We cannot expect that all relations can be solved explicitly as we did in 
Example 11.46 above. It is most fortunate, therefore, that once we know a solu- 
tion exists, we can often approximate that solution by numerical methods. 

The crux of the matter, then, is which relations have solutions? In order to 
state a result about the existence of solutions to a relation, we introduce addi- 
tional notation. Let V be an open subset of R”, f: V > R”, anda € V. Then 
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FIGURE 11.5 


the partial Jacobian of f generated by a subset {k1, k2,..., kn} of {1,2,..., m} at 
the point a is the number 


a ) 
Aa) os of (a) 
B( Fins ++++ Fin) (ay := det BE «@)| =det|  : : 
0(x1, eis Xn) Ox] nxn afk, a. 
a . (a) 
x1 IXn 


provided all these partial derivatives exist. For the case n = m, the correspond- 
ing partial Jacobian is just the Jacobian Af(a). We shall use partial Jacobians 
again in Chapter 12 to discuss change of variables for integrals in R”, and in 
Chapter 13 to introduce differential forms of order 2. 

For the next several pages, we shall represent a vector (x1,...,Xn,t1,--.,tp) 
in R"*? by (x,t). Here is a result about the existence of solutions to relations. 
It states as follows: If F is a C! function which satisfies F(xo, to) = 0 at some 
point (xoto), then the implicit relation F(x, t) = 0 can be solved for the variables 
X1,...,X,, when t is near tg, provided the partial Jacobian of F with respect to 
X1,X2,...,Xy (the variables we want to solve for) is not zero at (xo, tg). 


11.47 Theorem. [THE IMPLICIT FUNCTION THEOREM]. 

Suppose that V is open in R"*?, and that F = (F\,..., Fn) : V > R" is C! on 
V. Suppose further that F(xo, to) = 0 for some (Xo, to) € V, where xo € R" and 
to € R?. If 


O(Fi,..., Fr) 


Gio (Xo, to) £0, 
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then there is an open set W C RP containing to and a unique continuously 
differentiable function g : W — R" such that g(to) = xo, and F(g(t), t) = 0 for 
allt € W. 


StratTecy: The idea behind the proof is simple. If F took its range in R”*? 
instead of R” and had nonzero Jacobian, then, by the Inverse Function Theorem, 
F—! would exist and be differentiable on some open set. Presumably, the first n 
components of F—! would solve F for the variables X1,.-+, X,. Thus we should 
extend F (in the simplest possible way) to a function F which takes its range in 
R"*? and has nonzero Jacobian, and apply the Inverse Function Theorem to F. 
Here are the details. 


Proof. For each (x, t) € V, set 
F(x, t) = (Fi(x,t),..., Fr(&®),t1,..., tp). (30) 


Clearly, F: V > R"*+? and 


OF; 
DF = Ee # ’ 
Opxn I pxp 


where Op. represents a zero matrix, J.) represents an identity matrix, and 
B represents a certain n x p matrix whose entries are first-order partial deriva- 
tives of F;’s with respect to &%’s. Expanding the determinant of DF along the 
bottom rows first, we see by hypothesis that 


Ag(xo, to) = 1- OCs Bn) oe to) # 0 
re Oise s 


Since F (xo, to) = (0, to), it follows from the Inverse Function Theorem that 
there exist open sets Q; containing (Xo, to) and Q2 := F(Q)) containing (0, to) 
such that F is 1-1 on Q), and G := F~! is 1-1 and continuously differentiable 
on Qo. o 

Let @ = (G1,..., Gn). Since G = F~! is 1-1 from Q2 onto Qy, it is evident 
by (30) that 


(F(x, t)) =x (31) 
for all (x, t) € Q; and 
F(@(x, t), t) = (x, 8 (32) 


for all (x, t) € Qo. Define g on W := {t € R? : (0,t) € Q2} by git) = A(0, t). 
Since Q2 is open in R"t?, W is open in R?”. Since G is continuously differen- 
tiable on Q2 and ¢ represents the first n components of G, g is continuously 
differentiable on W. By the definition of g, the choice of xo, and (31), we have 


g(to) = (0, to) = o(F (xo, to)) = xo. 
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Moreover, by (30) and (32) we have F(@(x, t), t) = x for all (x, t) € Qo. Spe- 
cializing to the case x = 0, we obtain F(g(t), t) = 0 fort € W. 

It remains to show uniqueness. But ifh : W — R" satisfies F(h(t), t) = 0 = 
F(g(t), t) [ie., F(h(t), t) = (0, t) = F(g(t), t)], then g(t) = h(t) for all t € W, 
since F is 1-1 on Q2. |_| 


Theorem 11.47 is an existence theorem. It states that a solution g exists with- 
out giving us any idea how to find it. Fortunately, for many applications it is not 
as important to be able to write an explicit formula for g as it is to know that g 
exists. 

Here is an example for which an explicit solution is unobtainable. 


11.48 EXAMPLE. 


Prove that there is a function g(s,f), continuously differentiable on some 
B,(1, 0), such that 1 = g(1, 0), and 


sx? + tx3 +2 /t+s- t?x4 — x5 cost —x® = 1 


for x = g(s,t) and (s, t) € B-(, 0). 


Proof. If F(x,s,t) = sx? t+ tx? + 2/t+s5 + t2x4* — x cost — x® — 1, then 
F(1, 1,0) = 0, and F, = 2sx + 3tx? + 4t?x3 — 5x+cost — 6x? is nonzero at 
the point (1, 1, 0). Applying the Implicit Function Theorem to F, with n = 1, 
p =2, xo = 1, and (so, to) = (1, 0), we conclude that such a g exists. a 


Even when an explicit solution is obtainable, it is frequently easier to apply 
the Implicit Function Theorem than it is to solve a relation explicitly for one or 
more of its variables. Indeed, consider Example 11.46 again. Let F(x, s,t) = 1-— 
x? — s? — ?? and notice that Fy = —2x. Thus, by the Implicit Function Theorem, 
a continuously differentiable solution x = g(s, t) exists for each xo 4 0. 

The following example shows that the Implicit Function Theorem can be 
used to prove that differentiable solutions to a system of equations exist 


simultaneously. 


11.49 EXAMPLE. 


Prove that there exist functions u,v : R* — R, continuously differentiable 
on some ball B centered at the point (x, y,z,w) = (2,1,—1, —2), such that 
u(2, 1, -1, —2) =4, v(2, 1, —1, —2) = 3, and the equations 
urov w 
wtutuwr=29, S+5+5=17 
x z 


i) 
i) 


<< 


both hold for all (x, y, z, w) in B. 
Proof. Setn =2, p =4, and 


Flu, v,x,y,Z,w) = (u> fap* fy? = 99; Tae bead + v?/y* a. w/z? —17). 
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Then F(4, 3, 2, 1, —1, —2) = (0, 0), and 


OU ER): wie pu ca = 4duv oes 
a(u,v) «| 2u/x? 2v/y? | ye x2)" 


This determinant is nonzero when u = 4, v = 3, x = 2, and y = 1. Therefore, 
such functions u, v exist by the Implicit Function Theorem. ie) 


EXERCISES 


11.6.1. For each of the following functions, prove that f~! exists and is differ- 
entiable in some nonempty, open set containing (a, b), and compute 
D(f!)(a, b) 

a) f(u,v) = (u +0, 3u — 2v) at any (a, b) € R* 

b) f(u, v) = (u — v, cosu + sin v) at (a, b) = (0, 1) 
c) f(u, v) = (2uv, u* + v’) at (a, b) = (12, 13) 

d) f(u, v) = (ue“t”, log v — cos u) at (a, b) = (0, —1) 

11.6.2. For each of the following functions, find out whether the given expres- 
sion can be solved for z in a nonempty, open set V containing (0, 0, 0). 
Is the solution differentiable near (0, 0)? 

a) xy +z—cos(x ty +z) +1=0 

b) x+y? +23 + Jcos(x? + y2) +42 +8) =3 

c) x(2sin y — 3y sinz) + (zsinx — cosz) =0 

d) x7 + y? +27 4+ g(x, y,z) = 0, where g is any continuously differen- 
tiable function which satisfies g(0, 0,0) = 0 and g,(0, 0, 0) > 0 

11.6.3. Prove that there exist functions u(x, y), v(x, y), and w(x, y), and an 
r > 0 such that u,v,w are continuously differentiable and satisfy the 
equations 


xu? +yv?+we+w=0 
v—w+ytw=0 


ay aa S41 


on B,(1, 1), andu(1, 1) =1, v0, 1) =1, w0,1) =-1. 

11.6.4. Find conditions on a point (xo, yo, uo, vo) such that there exist real- 
valued functions u(x, y) and v(x, y) which are continuously differen- 
tiable near (xo, yo) and satisfy the simultaneous equations 


xu? + yv* — 3xy =5 
xv? + yur +3xy = 11. 


Prove that the solutions satisfy u? + v* = 16/(x + y). 
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11.6.5. 


11.6.6. 


11.6.7. 


11.6.8. 


11.6.9 |. 


Differentiability on R” 


Given nonzero numbers xo, yo, “uo, vo, 50, f9 Which satisfy the simulta- 
neous equations 


(*) uw +x? +y* =0 
vittx+tsy =0 
3xs2 + 2yt? =2 
(xs? — yt?)/2=0, 


Prove that there exist functions u(x, y), v(x, y), s(x, y), t(x, y), and 
an open ball B containing (x0, yo), such that u,v,s,t are continu- 
ously differentiable and satisfy (*) on B, and such that u(xo, yo) = uo, 
v(x0, Yo) = vo, 5(X0, Yo) = $0, t(x0, Yo) = fo. 

Let EF = {(x, y):0 < y < x} and set f(x, y) = (x+y, xy) for (x, y) € E. 


a) Prove that f is 1-1 from E onto {(s,t) : s > 2,/t,t > 0} and find a 
formula for f—!(s, f). 

b) Use the Inverse Function Theorem to compute D(f~ '\@(x, y)) for 
(x,y) EE. 

c) Use the formula you obtained in part a) to compute D(f~!)(s, t) di- 
rectly. Check to see that this answer agrees with the one you found 
in part b). 

Suppose that V is open in R”, that a € V, and that F : V > RisC! 

on. V; Ht F@)=0 7-F,,@)-andu? 3= Gi, 5.68 Fpy 4 Fn) 

for j = 1,2,...,n, prove that there exist open sets W; containing 

(a\,.-.,@j—-1,4j+1,.-.,4n), anr > O, and functions g(a), C! on 

Wj, such that F(x1,..., Xj-1, gj), xj41, ...,Xn) =Oon Wj and 


081 082 983 98n 
OXn OX] OX2 OXn—1 


= (-1)" 


on B,(a). 
Suppose that f : R* — R? has continuous first-order partial derivatives 
in some ball B, (x9, yo), r > 0. Prove that if A¢(xo, yo) 4 0, then 


af, | _ df2/dy(x0, yo) AF, | _ —4f1/dy(x0, yo) 
a (f 0, yo) = MG ae (f 0, yo) = =a 

and 

Of, _ =df2/8x(X0, yo) fz | __ af \/Ax(x0. Yo) 
e (f 0, yo)) = Gig ae (f 0, Yo)) = Gaaer * 


This exercise is used in Section *11.7. Let F : R’ > R be continuously 
differentiable in some open set containing (a, b,c) with F(a, b,c) = 0 
and VF (a, b,c) #0. 
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a) Prove that the graph of the relation F(x, y, z) = 0; that is, that the 
set G := {(x, y, z) : F(x, y, z) = 0} has a tangent plane at (a, b,c). 

b) Prove that a normal of the tangent plane to G at (a, b, c) is given by 
VF (a, b,c). 


11.6.10. Suppose that f := (u,v): R> R? is C? and that (xo, yo) = f(to). 


a) Prove that if f(t) 4 0, then w’(to) and v'(to) cannot both be zero. 

b) If f(t) 4 0, show that either there is a C! function g such that 
g(xo) = tf and u(g(x)) = x for x near xo, or there is a C! function h 
such that h(yo) = fo and v(h(y)) = y for y near yo. 


11.6.11. Let H be the hyperboloid of one sheet, given by x? + y* — 27 = 1. 


a) Use Exercise 11.6.9 to prove that at every point (a, b,c) € H, 1 has 
a tangent plane whose normal is given by (—a, —b, c). 

b) Find an equation of each plane tangent to 1 which is perpendicular 
to the xy-plane. 

c) Find an equation of each plane tangent to 1 which is parallel to the 
planex+y—z=l1. 


*11.7 OPTIMIZATION 


This section uses no material from any other enrichment section. 
In this section we discuss how to find extreme values of differentiable func- 
tions of several variables. 


11.50 Definition. 
Let V be open in R”, let a € V, and suppose that f : V > R. 


i) f(a) is called a local minimum of f if and only if there is an r > 0 such 
that f(a) < f(x) for all x € B,(a). 
ii) f(a) is called a local maximum of f if and only if there is an r > 0 such 
that f(a) > f(x) for all x € B,(a). 
iti) f(a) is called a local extremum of f if and only if f(a) is a local maximum 
or a local minimum of f. 


The following result shows that, as in the one-dimensional case, extrema of 
real-valued differentiable functions occur among points where the “derivative” 
iS Zero. 


11.51 Remark. /f the first-order partial derivatives of f exist at a, and f(a) is a 
local extremum of f, then V f(a) = 9. 


Proof. The one-dimensional function g(t) = f(a),...,@j—1,t, @j41,---,4n) 
has a local extremum at ¢ = a; for each j = 1,...,n. Hence, by the one- 
dimensional theory, 
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ae 
jy 8) = 8a) = 0. = 


As in the one-dimensional case, V f(a) = 0 is necessary but not sufficient for 
f (a) to be a local extremum. 


11.52 Remark. There exist continuously differentiable functions which satisfy 
V f(a) = 0 such that f (a) is neither a local maximum nor a local minimum. 


Proof. Consider 


f(x,y) =y? — x’. 


Since the first-order partial derivatives of f exist and are continuous every- 
where on R?, f is continuously differentiable on R2. Moreover, it is evident 
that V f(0) = 0, but (0) is not a local extremum (see Figure 11.6). | 


FIGURE 11.6 


The fact that the graph of this function resembles a saddle motivates the fol- 
lowing terminology. 


11.53 Definition. 


Let V be open in R", leta € V, and let f : V > R be differentiable at a. Then 
ais called a saddle point of f if V f(a) = 0 and there is a ro > 0 such that given 
any 0 < p < ro there are points x, y € B,(a) which satisfy f(x) < f(a) < f(y). 


By the Extreme Value Theorem, if f is continuous on a compact set H, then 
it attains its maximum and minimum on H; that is, there exist points a,b ¢ H 
such that 
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f(a) = sup f(x) and f(b) = inf f(x). 
xeH xeH 


When f is a function of two variables, these points can sometimes be found by 
combining Remark 11.51 with one-dimensional techniques. 


11.54 EXAMPLE. 


Find the maximum and minimum of f(x, y) = Cee ee i —2y on H = B(0, 0). 


Solution. If V f(x, y) = (0,0), then (x, y) = (1/2, 1). Since this point belongs 
to H, it might be an extremum of f on H. Set it aside. (If it did not belong to 
H, we would discard it.) 

By polar coordinates, 0H = {(x, y) = (2cos6,2sin@) : 6 € [0,27]} is es- 
sentially a one-dimensional set. Thus to find the extrema of f on 0H, we must 
optimize h(@) := f(2cos@,2sin@) = 2(2 — cos@ — 2sin@) on [0, 27]. Since 
h'(@) = O implies tan@ = 2, the critical points of h are 6 = arctan2 ~ 1.10715 
and @ = arctan2 + 7 © 4.24874. This provides two more candidates for extrema 
of fon H: (x, y) © (0.4472, 0.8944) and (x, y) © (—0.4472, —0.8944). Finally, 
the endpoints of [0, 277] provide a fourth candidate: (x, y) = (2, 0). 

Evaluating f at these four points, we see that the maximum of f on H is 
f (—0.4472, —0.8944) ~ 3.236 and the minimum of f on H is f(1/2, 1) = —1.25. 
[The values f (0.4472, 0.8944) + —1.236 and f(2, 0) = 2 are neither maxima nor 
minima of f on H.] | 


Using the second-order total differential D™ f introduced in Section 11.5, we 
can obtain a multidimensional analogue of the Second Derivative Test. First, we 
prove a technical result. 


11.55 Lemma. 
Let V be open in R", a € V, and f : V > R. [f all second-order partial 
derivatives of f exist at a and D™ f(a;h) > 0 for allh 4 0, then there is an 
m > 0 such that 


D® f (a; x) > ml|x||? (33) 


forallx € R". 


Proof. Set H = {x € R" : ||x|| = 1} and consider the function 


non 9 
g(x) i= D® f(a: x) = of (a) xjXk, x ER”. 


Xx Xj 
get” & Ox; 


By hypothesis, g is continuous and positive on R”\{0} and, hence, on H. Since 
H is compact, it follows from the Extreme Value Theorem that g has a positive 
minimum m on H. 
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Clearly, (33) holds for x = 0. If x 4 0, then x/||x|| € H, and it follows from 
the choice of g and m that 


D® fa;x) =o xi? = 2 (=) IIx|I? > mIlx|?. 


IIxll? IIx'l 
We conclude that (33) holds for all x € R”. a 


11.56 Theorem. [THE SECOND DERIVATIVE TEST]. 

Let V be open in R", a € V, and suppose that f : V > R satisfies V f (a) = 0. 
Suppose further that the second-order total differential of f exists on V and is 
continuous at a. 


i) If D® f(a;h) > 0 for allh ¢ 0, then f (a) is a local minimum of f. 
ii) If D™ f(a;h) < 0 for allh ¥ 0, then f (a) is a local maximum of f. 
iii) If D™ f (a; h) takes on both positive and negative values for h € R", thena 
is a saddle point of f. 


Proof. Choose r > 0 such that B,(a) C V, and suppose for a moment that 
there is a function ¢ : B-(0) — R such that e(h) — 0 ash > 0 and 


Fath) — f(a) = dD f(a; by + IIe) (34) 


for ||h|| sufficiently small. If D© f(a;h) > 0 for h ¥ 0, then (33) and (34) 
imply 


fat+h)— f(@) > (F +e) II? 


for ||h|| sufficiently small. Since m > 0 and e(h) > O ash — 9, it follows that 
f(ath) — f(a) > 0 for ||h|| sufficiently small; that is, f(a) is a local minimum. 
Similarly, if D” f(ash) < 0 forh 4 0, then f(a) is a local maximum. This 
proves parts i) and ii). 

To prove part iii), fix h € R” and notice that (34) implies 


f(at+th — f@) =" (50° ra h) + ihc) ) 


fort € R. Since e(th) > O ast — 0, it follows that f(a + th) — f(a) takes on 
the same sign as D® f(a; h) for t small. In particular, if D® f(a; h) takes on 
both positive and negative values as h varies, then a is a saddle point. 

It remains to find a function e : B,(0) — R such that c(h) > Oash — 0, 
and (34) holds for all ||h|| sufficiently small. Set (0) = 0 and 


f(a+h) — f(a) — 5D f(a; h) 


h) = 
ve ie 


he B,(0), h40. 
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By the definition of e(h), (34) holds for h € B,(0). Does e(h) > 0 ash > 0? 
Fix h = (Aj, /2,..., hy) € B-(0). Since V f(a) = 0, Taylor’s Formula implies 


1 
fat+h)— fa) =5D° feb) 
for some ec € L(a;a+h); that is, 


fla+h) — f(a) - 5D? fa: h) = ; (D® fe h) — D® f(a: h)) 


n n 


1 af a f 
o De & OX (c) Ox; OX «@)) Nahe 


j=lk=l 


Since |hjhx| < \|h||? and the second-order partial derivatives of f are contin- 
uous at a, it follows that 


2 2 
od, (c) oui @) > 0 


OX; OXK OX; OXK 


1 
0 < let) < 5 es 


as h — 0. We conclude by the Squeeze Theorem that c(h) > Oash—> 0. Hf 


The following result shows that the strict inequalities in Theorem 11.56 cannot 
be relaxed. 


11.57 Remark. /f D” f(a; h) > 0, then f(a) can be a local minimum or a can 
be a saddle point. 


Proof. f (0,0) is a local minimum of f(x, y) = x4+ + y?, and (0, 0) is a saddle 
point of f(x, y) =x?4 y?. | 


In practice, it is not easy to determine the sign of D@ f(a; h). For the case 
n = 2, the second total differential D® f(a; h) is a quadratic form (i.e., has the 
form Ah? + 2Bhk + Ck’). The following result shows that the sign of a quadratic 
form is determined completely by the discriminant D = AC — B?. 


11.58 Lemma. 
Let A,B,C €R, D = AC — B’, and o(h, k) = Ah? + 2Bhk + Ck’. 


i) If D > 0, then A and $(h, k) have the same sign for all (h, k) 4 (0, 0). 
ii) If D < 0, then $(h, k) takes on both positive and negative values as (h,k) 
varies over R*. 


Proof. 
i) Suppose that D > 0. Then A 4 0 and A@¢(h, k) is a sum of two squares: 


Ag(h, k) = A*h? + 2ABhk + ACk* = (Ah + Bk)? + Dk’. 
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Since A 4 0 # D, at least one of these squares is positive for each (h, k) 4 
(0, 0). It follows that A and ¢(h, k) have the same sign for all (h, k) 4 (0, 0). 
ii) Suppose that D < 0. Then either A 4 0 or B £0. 
If A 4 0, then A@g(h, k) is a difference of two squares: 


Ad(h, k) = (Ah + Bk —./|D\|k)\(Ah + Bk + V/|DI|k). 


The lines Ah + Bk —./|D|k =O and Ah+ Bk +./|D|k = 0 divide the hk-plane 
into four open regions (see Figure 11.7). Since A¢(h, k) is positive on two of 
these regions and negative on the other two, it follows that ¢(h, k) takes on 
both positive and negative values as (h, k) varies over R?. 


‘i 7, Ah + Bk — VDk 
/ 
/ 
/ 
/ 
/ 
/ 
/ 
/ = 
II / I a 
FA = An + Bk +-VDk 
ae 
ae = = 
eo A h 
sen / 
im , IV. 
/ 
/ 
/ 
FIGURE 11.7 


If A= Oand B #0, then 
o(h, k) = 2Bhk + Ck? = (2Bh+ Ckk. 


Since B # 0, the lines 2Bh + Ck = 0 and k = 0 divide the hk-plane into four 
open regions. As before, ¢(h, k) takes on both positive and negative values as 
(h, k) varies over R?. Ho 


This result leads us to the following simple test for extrema and saddle points. 


11.59 Theorem. Let V be open in R’, (a,b) € V, and suppose that f: V >R 
satisfies V f (a, b) = 0. Suppose further that the second-order total differential of 
f exists on V and is continuous at (a, b), and set 


D = fix(a, b) fyy(a, b) — fzy(@, b). 
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i) If D > Oand fyx(a, b) > 0, then f(a, b) is a local minimum. 
ii) If D > Oand f,x(a,b) < 0, then f(a, b) is a local maximum. 
iti) If D <0, then (a, b) is a saddle point. 


Proof. Set A = fxx(a,b), B = fxy(a,b), and C = fyy(a,b). Apply Theo- 
rem 11.56 and Lemma 11.58. | 


(For a discriminant which works for functions on R", see Colley [3], p. 250.) 


11.60 Remark. /f the discriminant D = 0, f(a, b) may be a local maximum, a 
local minimum, or (a, b) may be a saddle point. 


Proof. The function f(x, y) = x? has zero discriminant at (a, b) = (0, 0), and 
0 = f(0,0) is a local minimum for f. On the other hand, f(x, y) = x? has 
zero discriminant at (a, b) = (0, 0), and (0, 0) is a saddle point for f. | 


In practice, we often wish to optimize a function subject to certain constraints. 
(For example, we do not simply want to build the cheapest shipping container, 
but the cheapest shipping container which will fit in a standard railway car and 
will not fall apart after several trips.) 


11.61 Definition. 
Let V be open in R”, ae¢ V,and f,g;: V > Rfor j =1,2,...,m. 


i) f(a) is called a local minimum of f subject to the constraints g;(a) =0, j = 
1,...,m, if and only if there is a p > 0 such that x € B,(a) and gj(x) = 0 
for all j =1,...,m imply f(x) > f(a). 

ii) f(a) is called a local maximum of f subject to the constraints gj(a) =0, j = 
1,...,m, if and only if there is a p > 0 such that x € B,(a) and g;(x) = 0 
for all j =1,...,m imply f(x) < f(a). 


The following example serves two purposes: to illustrate Definition 11.61 and 
to motivate Theorem 11.63. 


*11.62 EXAMPLE. 


Find all points on the ellipsoid x? + 2y* + 3z* = 1 (see Appendix D) which lie 
closest to or farthest from the origin. 


Solution. We must optimize the distance formula ,/x? + y? + z?; equivalently, 
we must optimize the function f(x, y, z) = x7 + y* +z? subject to the constraint 
g(x, y,zZ) =x? +2y? +327 — 1 = 0. Using g to eliminate the variable x in f, we 
see that f takes on the form 


o(y,z) =1—y? — 22°. 
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Solving V@(y, z) = (0, 0), we obtain (y, z) = (0,0) (ie., x? = 1). Thus, elim- 
ination of x leads to the points (+1, 0,0). Similarly, elimination of y leads to 
(0, +1//2, 0), and elimination of z leads to (0, 0, £1/V/3). Checking the dis- 
tance formula, we see that the maximum distance is 1, which occurs at the 
points (+1, 0, 0), and the minimum distance is 1/./3, which occurs at the points 
(0,0, £1 /V3). (The points (0, +1 /V2, 0) are saddle points, that is, correspond 
neither to a maximum nor to a minimum.) | 


FIGURE 11.8 


Optimizing a function subject to constraints, as above, by eliminating one or 
more of the variables is called the direct method. There is another, more geomet- 
ric, method for solving Example 11.62. Notice that the points on the ellipsoid 
g(x, y, z) = x* +2y* +3z7 —1 = 0 which are closest to and farthest from the ori- 
gin occur at points where the tangent planes of the ellipsoid g(x, y, z) = 0 and 
the sphere f(x, y, z) = 1 are parallel (see Figure 11.8). Recall that two nonzero 
vectors a and b are parallel if and only if a+ Ab = 0 for some scalar 24 4 0. 
Since normal vectors of the tangent planes of f(x, y,z) = 1 and g(x, y,z) = 0 
are V f and Vg (see Exercise 11.6.9b), it follows that extremal points (x, y, z) of 
f(x, y, z) subject to the constraints g(x, y, z) = 0 must satisfy 


Vi (x, y, 2) +AVa(x, y, 2) =0 (35) 
for some A # 0. For the case at hand, (35) implies (2x, 2y, 2z) + A(2x, 4y, 6z) = 


(0, 0,0). Combining this equation with the constraint g(x, y,z) = 0, we have 
four equations in four unknowns: 


e041), y(2A + 1) =0, z3A+1)=0, and x7+2y*4+3=1. 
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Solving these equations, we obtain three pairs of solutions: (+1, 0,0) (when 
4 = —1), (0, 1/V2, 0) (when 4 = —1/2), and (0, 0, £1//3) (when 4 = —1/3). 
Hence, we obtain the same solutions with the geometric method as we did with 
the direct method. 

The following result shows that the geometric method is valid when the func- 
tions have nothing to do with spheres and ellipsoids, even when several con- 
straints are used. This is fortunate since the direct method cannot be used unless 
the constraints are relatively simple. 


11.63 Theorem. [LAGRANGE MULTIPLIERS]. 
Letm <n, V be open in R", and f, gj : V > R be C! on V for j =1,2,...,m. 
Suppose that there is ana € V such that 


0(g1,---+5 &m) 
O(X1,-.-, Xm) 


(a) #0. 


If f(a) is a local extremum of f subject to the constraints gx(a) = 0, k = 


1,...,m, then there exist scalars d1, 2, ..., Am such that 
m 
Vi (a) + >> AcVgx(a) = 0. (36) 
k=1 


Proof. Equation (36) is a system of n equations in m unknowns, 
A1,A2,-.+,Am: 


m 
r) 0 

y ip aye = a, JH, 2y ee, 70: (37) 

a oo ons 


The first m of these equations forms a system of m linear equations in m vari- 
ables whose matrix of coefficients has a nonzero determinant and, hence, this 
system uniquely determines the A;’s. What remains to be seen is that because 
f(a) is a local extremum subject to the constraints g;,(a) = 0, these same 
Ax’s also satisfy (37) for j = m+ 1,...,n. This is a question about implicit 
functions. 

Let p = n-—~m. As in the proof of the Implicit Function Theorem, write 
vectors in R™*? in the form x = (y,t) = (y1,..., ym. h,---s tp). We must 
show that 


_ of wy 88k 
0=7-@4 2 a (38) 


for£=1,...,p. 
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Let g = (g1,..., 8m), and choose yo € R”, to € R” such that a = (yo, to). 
By hypothesis, g(yo, to) = 0 and the partial Jacobian of g (with respect to the 
variables y;) is nonzero at (yo, to). Hence, by the Implicit Function Theorem, 
there is an open set W Cc R? which contains tg, and a function h : W > R” 
such that h is continuously differentiable on W, h(to) = yo, and 


g(h(t), t) = 0, te W. (39) 
For eacht € W andk = 1,...,m, set 
Gx(t) = gx (h(t),t) and F(t) = f(h(t), t). 


We shall use the functions Gj,..., Gj», and F to verify (38) for 2 = 1,..., p 
Fix such an é. By (39), each G, is identically zero on W and, hence, has deriva- 
tive zero there. Since to € W and (h(to), to) = (yo, to) = a, it follows from the 
Chain Rule that 


ah, oh, 
— (t Sake Bp 
an to (to) 


gk ase 9] | Mt). Shag ) 
O = DGx(to) = [sew ve a a 
x] 
1 ais 0 
ee oe 
Hence, the éth component of DG; (to) is given by 


a) 0 
0= yo ota to) + Ea (40) 
j=l 


fork = 1,2,...,m. Multiplying (40) by A, and adding, we obtain 


= 0= ato Tito )+ Yai 


k=1 j= 


1 
a (Syd) & — (to) + a cea) 
ah 


j=l \k=1 


Hence, it follows from (37) that 


0=-yo of a it) + Dag. (41) 
mA 
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Suppose that f(a) is a local maximum subject to the constraints g(a) = 0. 
Set Eo = {x € V : g(x) = 0}, and choose an n-dimensional open ball B(a) 
such that 


x€ Bia)N Eo implies f(x) < f(a). (42) 
Since h is continuous, choose a p-dimensional open ball B(to) such that t € 


B(to) implies (h(t), t) € B(a). By (42), F(to) is a local maximum of F on B(tg). 
Hence, V F'(to) = 0. Applying the Chain Rule as above, we obtain 


0 dh; a 
0= wy) +a (43) 


[compare with (40)]. Adding (43) and (41), we conclude that 


af 98k 
0= r ; a 
a yok a 
k=l 
11.64 EXAMPLE. 
Find all extrema of x?+y?+z7 subject to the constraints x—y = 1 and y?—z? = 1. 


Solution. Let f(x, y,z) = Pogo > + 27, g(x,y,z)=x—-—y-—Ilandh(x, y,z) = 
y? — z* — 1. Then (36) takes on the form V f +AVg + uVA = 0; that is, 


(2x, 2y, 2z) + AC, —1, 0) + (0, 2y, —2z) = (0, 0, 0). 


In particular, 2x +4 = 0, 2y + 2uy — A = 0, and 2z — 2uz = 0. From this last 
equation, either wu = 1 or z = 0. 
If ~ = 1, then A = 4y. Since 2x + A = 0, we find that x = —2y. From g = 0 


we obtain —3y = 1 (ie., y = —1/3). Substituting this into h = 0, we obtain 
z? = —8/9, a contradiction. 

If z = 0, then from h = 0 we obtain y = +1. Since g = 0, we obtain x = 2 
when y = 1, and x = 0 when y = —1. Thus, the only candidates for extrema of 


f subject to the constraints g = 0 =A are f(2,1,0) = 5 and f(0, —1,0) = 1. 
To decide whether these are maxima, minima, or neither, look at the problem 
from a geometric point of view. The problem requires us to find points on the 
intersection of the plane x — y = 1 and the hyperbolic cylinder y? — z? = 1 
which lie closest to the origin. Evidently, both of these points correspond to 
local minima, and there is no maximum (see Figure 11.9). In particular, the 
minimum of x* + y* + 2? subject to the given constraints is 1, attained at the 
point (0, —1, 0). | 
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FIGURE 11.9 


EXERCISES 


11.7.1. 


11.7.2. 


11.7.3. 


11.7.4. 


Find all local extrema of each of the following functions. 


a) f@,y) =x? —xyt+y?—y 

b) f(x, y) =cosx +cosy 

c) f(x, y,z) =e sinz 

d) f(x, y) = ax? + bxy + cy*, where a 4 0 and b? — 4ac £0 

For each of the following, find the maximum and minimum of f on H. 


a) f(x, y) =x? +2x — y? and H = {(x, y): x7 +4y? < 4} 

b) f(x,y) = x* + 2xy + 3y?, and H is the region bounded by the 
triangle with vertices (1,0), (1,2), (3,0) 

c) f(x, y) =x? + 3xy — y’, and HW = [-1, 1] x [-1, 1] 


For each of the following, use Lagrange multipliers to find all extrema 
of f subject to the given constraints 


a) f(x,y) =x+y?andx?+y* =4 

b) f(x, y) =x? —4xy + 4y? and x74 y? =1 

c) f(x,y, z=xy, xr +y?+2=landxt+y+z=0 

d) f(x, y,z,w) =3x+y+u, 3x7 +y+4z3 = 1 and —x7+3z++w =0 


Suppose that f : R” > R” is differentiable at a, and that g: R” > R 
is differentiable at b = f(a). Prove that if g(b) is a local extremum of 
g, then V(g 0 f)(a) = 0. 
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11.7.5. Suppose that V is open in R’, that (a,b) € V, and that f : V > Rhas 
second-order total differential on V with f,(a,b) = f\(a,b) = 0. If 
the second-order partial derivatives of f are continuous at (a, b) and 
exactly two of the three numbers f,,(a,b), fxry(a, b), and fyy(a, b) are 
zero, prove that (a, b) is a saddle point if fry (a, b) £ 0. 

11.7.6. Suppose that V is an open set in R”, thata € V, andthat f:V —>R 
is C* on V. If f(a) is a local minimum of f, prove that D® f(a; h) > 0 
for allh € R”. 

11.7.7. Let a,b,c, D,E be real numbers with c # 0. 


a) If DE > 0, find all extrema of ax + by + cz subject to the constraint 
z = Dx? + Ey’. Prove that a maximum occurs when cD < 0 anda 
minimum when cD > 0. 

b) What can you say when DE < 0? 


11.7.8. [ImpLicir METHOD]. 


a) Suppose that f,g : R? — R are differentiable at a point (a, b,c), 
and that f(a, b,c) is an extremum of f subject to the constraint 
g(x, y, Zz) =k, where k is a constant. Prove that 


) 0 0 0 
Gb. c) 2 (a,b, c) =. DG c) 2 (a, b, c) =0 
dz dz Ox 


ax 
and 
0 0 0 0 
LE ICR ee CRs enh 
dy az Oz dy 


b) Use part a) to find all extrema of f(x, y, z) = 4xy+2xz+2yz subject 
to the constraint xyz = 16. 


11.7.9 |. This exercise is used in Section *14.4. 


a) Let p > 1. Find all extrema of f(x) = Ya subject to the 
constraint )-y_, [xx |? = 1. 
b) Prove that 


l n 1/p an 1/2 ¥ I/p 
2 
pvp) (S20 < (>) < pang 

k=1 k=1 k=1 


for all x1,...,%, € R, n € N,and1 < p <2. 


11.7.10. [LEAsT SQUARES OR LINEAR REGRESSION]. 
Suppose that points x and y are fixed in R” and set 


n n 2 
do =nyt- (Sox) F 
k= k=1 


462 Chapter 11 Differentiability on R” 


a) 


b 


wa 


Prove that if 
n 
F(a,b) = Do (ye — (axe +b) 
k=1 


for (a, b) € R’, then the system 


gE Pectin ras 
Ras a 


is solved by 

see Doka1 kV — oka Tk oka Yk 

; 7 
and 
pee pee xp opt Yk Det Nk eS Xk Yk 

; i ; 
Prove that if ag and bo are given by part a), then the straight line 
whose graph is closest to the points (x1, y1),..., (Wn, Yn) —that is, 
such that 


Se — (xg +B)? 


k=1 


is minimized —is the line A(x) = agx + bo. 


CHAPTER_=— 12 


Integration on R” 


12.1 


JORDAN REGIONS 


In this section we define grids and use them to identify special subsets of R”, 
called Jordan regions, which have a well-defined volume. In the next section, 
when we define integrals of multivariable functions on Jordan regions, grids will 
play the role that partitions did in the one-variable case. 

Throughout this chapter, R will represent a nondegenerate n-dimensional 
rectangle; that is, 


R= [ay, bj] x «++ x [dy, by] = {x eR’: 47S lay, bj) for f= 1,. 3457), (1) 


where a; < b; for j = 1,2,...,n. A grid on R is a collection of n-dimensional 
rectangles G = {R1,..., Rp} obtained by subdividing the sides of R; that is, for 
each j = 1,...,n there are integers v; € N and partitions P; = Pj|(G) = a’ : 
k =1,...,v;} of [a;, bj] such that G is the collection of rectangles of the form 
I,x...xIn, where each J; = beeen 5] for some k = 1,..., vj (see Figure 12.1). 
A grid G is said to be finer than a grid 1 if and only if each partition P;(G) is 
finer than the corresponding partition P;(H), j = 1,...,n. Notice that given 
two grids G and H, there is a grid Z which is finer than both G and . [Such a 
grid can be constructed by taking P;(Z) = Pj}(G) UP; (H) for j =1,...,n.] 

If R is an n-dimensional rectangle of the form (1), then the volume of R is 
defined to be 


|R| = (b — a1)... (bn — an). 


(When n = 1, we shall call |R| the length of R, and when n = 2, we shall call 
|R| the area of R.) Notice that given ¢ > 0 there exists a rectangle R* such that 
R C (R*)? and |R*| = |R| + €. Indeed, since b; — aj + 25 > bj; — a; asd > 0, 
we can choose 5 > 0 so small that R* := [a; — 6, bj + 6] x «++ x [an — 6, bn + 6] 
satisfies |R*| = |R| +. 

We want to define the integral of a multivariable function on a variety of 
sets; for example, the integral of a function of two variables on rectangles, disks, 
triangles, ellipses, and the integral of a function of three variables on balls, cones, 
ellipsoids, pyramids, and so on. One property these regions all have in common 
is that they have a well-defined “area” or “volume.” 

How shall we define the volume of a general set E? Let R be a rectangle 
which contains E. If E is simple enough, we should be able to get a good 
approximation for the volume of E by choosing a sufficiently fine grid G on 
R and adding up the collective volumes of all rectangles in G which intersect E. 
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a Xp XX X4 b x 
FIGURE 12.1 


Accordingly, we define the outer sums of E with respect to a grid G ona rectangle 
R by 


V(EsG):= > IRjl, 


Rj NED 


where the empty sum is by definition zero. Notice that since the empty sum is 
defined to be zero, V(@; G) = 0 for all grids G. 

Figure 12.2 illustrates an outer sum for a particular set E and grid G. The 
rectangles which intersect E have been shaded; those which cover 3 E are darker 
than those which are contained in E°. Notice, even for this crude grid, that the 
shaded region is a fair approximation to the “volume” of E. 


OE aie 


FIGURE 12.2 


Section 12.1 Jordan Regions 465 


The following result shows that as the grids get finer, the outer sum approxi- 
mations to the volume of E get better. 


12.1 Remark. Let R be an n-dimensional rectangle. 
i) Let E be a subset of R, and let G, H be grids on R. If G is finer than H, then 


V(E;G) < V(E; H). 
ii) If A and B are subsets of R and A © B, then 


V(A;G) < V(B;G). 


Proof. i) Since G is finer than 1H, each Q ¢€ 7 is a finite union of R;’s in G. 
If ONE # Y, then some of the R j 8 in Q intersect E and others might not 
(see Figure 12.3, where the darker lines represent the grid H, the lighter 


lines represent G \ 7H, and the Rj’s which intersect E are shaded). Let 
ZT ={R € G: ROE ¢< BandZ, = {RE G\T,: R © Q for some 
Q€Hwith ON E £G}. Then 


V(E;H) = Do IRI + D2 IRL> DO IRI = VEE; G). 


ReT, Rey ReT 


ii) If A C B, then A C B (see Exercise 9.2.3). Thus, every rectangle which 
appears in the sum V(A; G) also appears in the sum V(B; G). Since all |Rj|’s 
are nonnegative, it follows that V(A; G) < V(B; 9). | 


In view of this, we guess that the volume of a set E can be computed by 
taking the infimum of all outer sums of E. Unfortunately, this guess is wrong 
unless some restriction is made on the set E. To see why a restriction is neces- 
sary, notice that any reasonable definition of volume should satisfy the following 


FIGURE 12.3 
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property: If E = AU B, where B = E \ A, then the volume of EF equals the sum 
of the volumes of A and B. The following example shows that this property does 
not hold if A is too fractured. 


12.2 EXAMPLE. 


If R = [0,1] x [0,1], A = {(x, y) : x,y € QAO, 1}, and B = R \ A, then 
V(A; G) + V(B; G) = 2V(R; G) no matter how fine G is. 


Proof. Let G = {R,..., Rp} be a grid on R. Since each R; is nondegenerate, 
it is clear by the Density of Rationals (Theorem 1.18) that Rj; NA ¥ Y for all 
j € U1, p]. Hence V(A; G) = |R| = 1. Similarly, the Density of the Irrationals 
(Exercise 1.3.3) implies V(B; G) = |R| = 1. a 


The real problem with A is that its boundary, 9A := A \ A? = R, is too 
big. To avoid this type of pathology, we will restrict our attention to “Jordan” 
regions; that is to sets whose boundaries are small in the following sense (see 
Definition 12.5 and the darkly shaded rectangles in Figure 12.2). 


12.3 Definition. 


A subset E of R” is said to be of volume zero if and only if given e > 0 there is 
rectangle R D> E anda grid G = {Rj,..., Rp} on R such that V(E; G) <«. 


Recall that E is covered by {Ox}i_, means that E C Ws Ox. By Defini- 
tion 12.3, E is of volume zero if and only if it is covered by rectangles (from a 
grid G) whose total volume is as small as one wishes. The next result contains 
two other (highly useful) descriptions of sets of volume zero. 


12.4 Theorem. For every subset E of R", the following three conditions are 
equivalent. 


i) E is of volume zero. 
ii) There is an absolute constant C > 0 such that for each ¢ > 0 there is a 
rectangle R, which contains E, and a grid G on R such that 


V(E;G) < Ce. 


(The constant C > 0 may depend on E but does not depend on « or G.) 
iti) For every ¢ > 0 there is a finite collection of cubes Qx of the same size; that 
is, all with sides of length s, such that 


q q 
EC\)Q and Y"|Ol<e. 


k=1 k=1 


In particular, if E is a set of volume zero and A C E, then both A and 0A are sets 
of volume zero. 
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Proof. i) implies ii). If VCE; G) < e, then V(E; G) < Ce for C = 1. 

ii) implies iii). Let e > 0 and set n = €/(2C). By hypothesis, there exists a 
grid G such that if {R),..., Rp} represents all rectangles in G which intersect 
E, then 


P P 
Ec\(JR; and S°|Rj| < Cn. 

j=l j=l 
By increasing the size of the R;’s slightly, we may suppose that the sides of 
each R; have rational lengths, and poe |Rj| < 2Cn = «. (These rectangles 
no longer form a grid because they may have some overlap, but they still cover 

E and hence E itself.) 

The lengths of the sides of the R;’s have a common denominator, say d. 
By using a grid fine enough, we can divide each R; into cubes oe ) for 
k = 1,2,...,vj; and some choice of v; € N, such that each om has sides 


of common length s = 1/d. Since |Rj| = eae 1 lo )), it follows that 


PY P 
DV Ve? = Vay <e. 
j=l 


j=lk=l1 
iii) implies i). Suppose that E can be covered by finitely many cubes 
Ox = fay, By] x ++ x fal, bY] 


whose volumes sum to a number less than ¢. Let R be a rectangle which 
contains the union of the Q;’s. For each j = 1,2,...,n, the endpoints 
{a ; ae soph, a”, po m can be arranged in increasing order to form a parti- 
tion of the jth side of R. Thus there is a grid G = {R1,..., Rp} so fine that each 
QO; is a union of the R;’s (see Figure 12.4). Thus V(E;G) < pa |Oxz| < 6&3 
that is, E is a set of volume zero by definition. This completes the proof that 
conditions 1), 11), and iii) are equivalent. 

Finally, suppose that A C E and G is a grid on some rectangle that contains 
E. Since 0A C A C E, it is clear that V(0A, G) < V(A,G) < V(E, G). Thus if 
E isa set of volume zero, then so are A and 0A. B 


We are now prepared to define volume. 


12.5 Definition. 


A subset E of R” is called a Jordan region if and only if E C R for some 
n-dimensional rectangle R and 0E is of volume zero, in which case we define 
the volume (or Jordan content) of E by 


Vol(E) := int V(E; G) := inf{V(E; G) : G ranges over all grids on R}. 
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FIGURE 12.4 


We sometimes shall call Vol(E) length when n = 1 and area when n = 2. 
Notice, then, that the empty set is of length, area, and volume zero. 

By Theorem 12.4, it is clear that a set A is of volume zero if and only if 
Vol(A) = 0. Since rectangles are bounded, it is also clear by Definition 12.5 
that every Jordan region is bounded. The converse of this last statement is false. 
The set A in Example 12.2 is bounded but not a Jordan region. 

Notice that by Theorem 12.4 and Definition 12.5, a set E is a Jordan region if 
and only if its boundary can be covered by cubes whose total volume is as small 
as one wishes. We shall use this observation many times in the sequel. 

Before we continue, we need to show that Vol(E) does not depend on the 
rectangle R chosen to generate the grids G. To this end, let R and @Q be rectan- 
gles which contain E. Since the intersection of two rectangles is a rectangle, we 
may suppose that E C QC R. Since Q C R, it is easy to see that 


inf V(E;H) < inf V(E;Q). 
Hon Q GonR 


On the other hand, given ¢ > 0, choose, for each grid H on Q, a rectangle Q* 
such that Q Cc (Q*)° and |Q*| = |Q| + e. Let Ho be the grid formed by adding 
the endpoints of Q* and R to 7; that is, if 


O* = [a, Bi] x --- x [an, Bn] and R = [y1, 51] x «++ x [yn, Sn], 


then P;(Ho) = Pj(H) U {a;, Bj, yj, 6;}. Then Go := HoN R is a grid on R whose 
rectangles which intersect E are either part of 1 to begin with or the thin ones 
formed by adding the endpoints of Q*. Hence, 


V(E; H) + € = V(E; Go) = au V(E; G). 
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It follows that 
inf V(E;H) < inf V(E;G) < inf V(E;H) +e, 
Hon QO GonR Hon @Q 


for every ¢ > 0. By letting « — 0, we verify that the definition of volume does 
not depend on the rectangle R. 

Next, we show that the two definitions of the volume of a rectangle (length 
times width times etc. versus the infimum of outer sums) agree. 


12.6 Remark. Jf R is an n-dimensional rectangle, then R is a Jordan region in 
R’ and 


Vol(R) = |R|. 
Proof. Let « > 0 and suppose that 


R = [a1, by] me Ht lan, bn). 


Since bj — aj — 26 — b; — aj; as 5 — 0, we can choose 5 > 0 so small that if 
Q = [a1 + 46, b, — 6] x +--+ X [an +8, bn — 8), 


then |R| — |Q| <e. 
Let G := {Hj,..., Hj} be the grid on R determined by 


Pj (G) = {aj,aj +,b; — 5, dj}. 


Then it is clear that an H; € G intersects dR if and only if H i # Q. Hence, 


VOR;G):= > IH =|RI-IOl <e. 


HjNORFO 


This proves that R is a Jordan region. 

To compute the volume of R using Definition 12.5, let G = {R1,..., Rp} be 
any grid on R. Since R; 1 R # W for all R; € G, it follows from definition that 
V(R; G) = |R|. Taking the infimum of this identity over all grids G on R, we 
find that Vol(R) = |R|. a 


In general, it is not easy to decide whether or not a given set is a Jordan 
region. Topology alone cannot resolve this problem since there are open sets in 
R" which are not Jordan regions (see Spivak [12], p. 56). In practice, however, 
it is usually easy to show that a specific set E is a Jordan region by applying 
Theorem 12.4 to dE. Here is a typical example. 


12.7 Theorem. Jf FE, and Ez are Jordan regions, then E, U E2 is a Jordan 
region and 


Vol(E, U Ex) < Vol(E}) + Vol(E2). 
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Proof. We begin by proving that E) U £2 is a Jordan region. Since E; and E2 
are Jordan regions, use Theorem 12.4 to choose cubes {S;} which cover 0 EF 
(respectively, cubes {Q;} which cover 0 £2) such that 


P P 

é E 
> ISjl <5 and SI Oil < 7 
j=l k=1 


But by Theorem 9.19 or 10.31, 0(E; U Ez) € dE, UOEd. Thus {S;} U {Q;x} isa 
collection of cubes which covers 0(£; U E2) whose volumes sum to a number 
less than e. Hence by Theorem 12.4 and Definition 12.5, FE; U E2 is a Jordan 
region. 

To estimate the volume of E,; U Ep, let G be a grid on a rectangle which 
contains E) U E). If R; intersects E, U £2, then by Theorem 9.19 (or 10.31) R; 
intersects E, or E> (or both). Hence, V(E, U E2; G) < V(E1; G) + V(E2; G). 
Taking the infimum of this inequality over all grids G, we obtain 


Vol(E| U E2) < Vol(E}) + Vol(E2). a 


By iterating this result, we see that the collection of Jordan regions is closed 
under finite unions. This is also the case for intersections and set differences (see 
Exercise 12.1.6b). 

Theorem 12.4 can also be used to show that spheres, ellipsoids, and, in fact, 
all “projectable regions” (just about anything you can draw) are Jordan regions 
(see Exercise 12.1.4 and Theorem 12.39). 

To evaluate integrals of multivariable functions over unions of sets, we intro- 
duce the following concept. 


12.8 Definition. 


Let € := {E¢}een be a collection of subsets of R”. 


i) € is said to be nonoverlapping if and only if E; N Ex is of volume zero for 
JFK. 
ii) € is said to be pairwise disjoint if and only if E; N Ex = @ for j #k. 


Notice that since % is of volume zero, every collection of pairwise disjoint sets 
is nonoverlapping. (The converse of this statement is false —see Exercise 12.1.3 
below.) 

In order to prove a change-of-variables formula in R” in Section 12.4, we need 
to identify conditions under which a C! function preserves Jordan regions (see 
Theorem 12.10). Since Jordan regions are sets whose boundaries are of volume 
zero, we first prove a result about functions which take sets of volume zero to 
sets of volume zero. 


12.9 Lemma. 
Suppose that V is a bounded, open set in R" and that @ : V + R" is I-I and (os 
on V with Ag # 0. If E is of volume zero and E C V, then $(E) is of volume 
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zero. In particular, if {Ex}ken is a nonoverlapping collection of sets in R" with 
Ex C V for allk €N, then {6(Ex)}cen is a nonoverlapping collection of sets 
in R". 


Proof. Since E Cc V, for each x € E there is an r(x) > O such that 
By) (x) C V. Since 


EC|J Bw, 
xcE 
it follows from the Heine—Borel Theorem that there exist finitely many x, € E 
such that the bounded open set 


N 
U := J Brgy x) 
j=l 


contains E. Set H := U. Since U is bounded, H is compact. Moreover, the 
construction guarantees that E C H°® C H Cc V. Thus by Corollary 11.34, 
there exists an M > 0 which depends only on H, @, and n such that 


lp(x) — o(y) || < Milx—yll, x ye, (2) 
for all cubes O C H. 
Let « > O and set C := (2A)", where A := M./n. Since E is of volume zero 


and E C H°, use Theorem 12.4 to choose cubes Q),..., Qz, all the same size 
with sides of length s, such that Q; C H, 


q q 
EcUQ;, and rial <=. (3) 
j=l j=l 
Fix j and fix x9 € Q;. By (2) and Remark 8.7, 
IIP(Xo) — O(x)|| < Ms /n = 5A 
for allx € Q;. Thus $(Q;) is contained in the cube oF = I, x---x In, where 
Tk := [bk (Xo) — SA, Ox (X0) + 5A]. 
In particular, it follows from the left side of (3) and Theorem 1.37 that 


q q q 
o(E)c | (J 9;) =U ¢@) U9. 
j=l j=l 


j=l 
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But the sides of 0; are of length 2s A, so |0;| = (2sA)" = Cs”. In particular, 
the right side of (3) implies that 


q q q 
YG; = dies" =C >“ 19)| <C5=6. 


j=l j=l j=l 


Therefore, Vol(@(E)) = 0 by Theorem 12.4. 
Finally, by what we just proved, if E, Ej; is of volume zero, then so is 
@(E; M E;). But by Exercise 1.5.7 (since ¢ is 1-1), 


b(Ex) 1 O( Ej) = O(EK Ej). 
Thus {@(£;)} is nonoverlapping when {£;} is nonoverlapping. a 


12.10 Theorem. Suppose that V is a bounded, open set in R" and that @ : 
V > R’is 1-1 andC! on V with Ag # 0. If E is a Jordan region and E C V, 
then (E) is a Jordan region. 


Proof. By Definition 12.5, Theorem 12.4, and Lemma 12.9, it suffices to prove 
that d(@(E)) C P(E). By Theorem 11.39, the set @(E°) is open, and by 
Theorem 9.54 (or 10.61), the set @(E) is closed. It follows from Theorem 9.14 
(or 10.25) that @(E°) € (@(E))° and @(E) D> ¢(E). Therefore, 


d(P(E)) = $(E) \ (P(E)? C OE) \ (E*) = OE \ E°) = OE). a 


We close this section with some optional results which will not be used else- 
where. They show that the volume of a set can also be approximated from below 
using inner sums. 


We introduced outer sums (analogues of upper sums) and defined the volume 
of a Jordan region as the infimum of all outer sums. In order to calculate the vol- 
ume of a specific set, it is sometimes convenient to have inner sums (analogues 
of the lower sums we used to define integrals in Chapter 5). Given E Cc R",a 
subset of some n-dimensional rectangle R, and G = {R; : j = 1,..., p}, a grid 
on R, the inner sums of E with respect to G are defined by 


EG) = >> Ril, 


R;CE° 


where the empty sum is again interpreted to be zero. Thus v(£; G) = 0 for all 
grids G and allsets E satisfying E° = @. 

Inner and outer sums can be used to define inner and outer volume of ANY 
bounded set, in the same way that upper and lower sums were used to define 
upper and lower integrals of any bounded function (see Definition 12.13 below). 
If G is fine enough and E is Jordan, the inner sum of a Jordan region E with 
respect to G should approximate Vol(£); just as V(E; G) overestimated Vol(E£), 
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each v(E; G) underestimates Vol(£). [In Figure 12.2, the underestimate v(E; G) 

is represented by the lightly shaded rectangles. You might refine the grid there 

and revisualize the inner and outer sums to illustrate that these estimates get 
better as the grid gets finer. ] 

Since v(E; 7) is either zero or a sum of nonnegative terms, it is clear that 


v(E; H) > 0 for all grids H. If we combine this observation with the proof of 
Remark 12.11, we can also establish the following result. 


12.11 Remark. Let R be an n-dimensional rectangle, let E be a subset of R, and 
let G, H be grids on R. If G is finer than H, then 


0 < (E; H) < v(E; G) < V(E;G) < V(E; HH). 
This leads us to the following fundamental principle. 


12.12 Remark. Let R be an n-dimensional rectangle and E be a subset of R. If G 
and Hare grids on R, then 


0 < v(E;G) < V(E; H). 
Proof. Let Z be a grid finer than both G and H. By Remark 12.11, 
0 < v(E;G) < (E:T) < V(E;T) < V(E; H). a 


Using the sums v(E; G) and V(E; G), we can define inner and outer volume 
of any bounded set E. 


12.13 Definition. 


Let E be a bounded subset of R” and let R be an n-dimensional rectangle 
which satisfies E C R. The inner volume of E is defined by 


Vol(E) := sup{v(E; G) : G ranges over all grids on R}, 
and the outer volume of E is defined by 


Vol(E) := inf{V(E; G) : G ranges over all grids on R}. 


As before, we can show that this definition is independent of the rectangle R 
chosen to generate the grids G. 

When E is a Jordan region, the outer and inner volume of E is precisely the 
volume of E. 


12.14 Theorem. Let E be a bounded subset of R". Then E is a Jordan region if 
and only if Vol(E) = Vol(£). 
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Proof. Let E Cc R" and suppose that R is a rectangle which contains E. We 
shall show that for all grids G on R, 


V(E; G) — v(E; G) = VE; G). (4) 


If E° = 9, then JE = E and (4) is obvious. Otherwise, suppose that R; € G is 
a rectangle which appears in the sum represented by the left side of (4); that 
is, R; intersects E but R; is not a subset of E°. If R; does not appear in the 
sum represented by the right side of (4), then R; N dE = G. It follows that 
the pair E°, (R"\E)° separates R;, a contradiction since all rectangles are 
connected (see Remark 9.59). Therefore, every rectangle which appears in 
the sum represented by the left side of (4) also appears in the sum represented 
by the right side; that is, 


V(E; G) — v(E; G) < V(OE;G). 


On the other hand, suppose that R; € G is a rectangle which appears in the 
sum represented by the right side of (4); that is, Rj; NdE #4 YJ. Recall from 
Theorems 9.6 and 9.18 (or 10.30 and 10.22) that dE = E \ E°® is closed, so 
R; dE # W. It follows that Rj intersects E but R; is not a subset of E°. 
Thus every rectangle which appears in the sum represented by the right side 
of (4) also appears in the sum represented by the left side; that is, V(E; G) — 
v(E; G) = V(dE; G). This proves (4). 

To prove the theorem, suppose that E is a Jordan region. By definition, 
Vol(@E) = 0. Since by (4), V(GE; G) = V(E; G) — v(E; G) = Vol(E) — Vol(E), 
it follows (by taking the infimum of this last inequality over all grids G) that 


0 = Vol(QE) > Vol(E) — Vol(E) > 0. (5) 
Thus Vol(E) = Vol(E). 
Conversely, suppose that Vol(E) = Vol(E). By the Approximation Prop- 
erty, given ¢ > 0, there exist grids 7{; and 2 such that 


Vol(E) +e > V(E;H\) and Vol(E) —« < v(E; H2). 


If G is a grid on R which is finer than both 7; and Ho, it follows from 
Remark 12.11 that 


Vol(E) +e > V(E;G) and Vol(E)—« < v(E;G). 
Subtracting these inequalities, we see by (4) that 
0 < V(QE; G) = V(E; G) — v(E; G) < Vol(E) — Vol(E) + 2¢ = 2e. 


Hence E is a Jordan region by definition. a 
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EXERCISES 
12.1.1. a) Form = 1, 2, 3, let G,, be the grid on [0, 1] x [0, 1] generated by 
Fi OGn) Sth 2 eS 0 Asso 52) 


where j = 1, 2. For each of the following sets, compute V(E; Gi). 
a) E = {(x, y) € [0,1] x [0, 1]: x =Oor y = O}. 
B) E ={(, y) € [0, 1] x [0, 1]: y < x}. 
vy) E={(x, y) €[0, 1] x (0, 1]: Qx — D2 + Qy — 1)? < 1}. 
b) For each £ in part a), compute v(E; G,,). 


12.1.2. a) Prove that every finite subset of R” is a Jordan region of vol- 
ume zero. 


b) Show that, even in R’, part a) is not true if finite is replaced by 
countable. 


c) By an interval in R* we mean a set of the form 
{(x,c):a<x<b} or {(c,y):a<y<b} 


for some a,b,c € R. Prove that every interval in R2 is a Jordan 
region. 
12.1.3. Prove that every grid is a nonoverlapping collection of Jordan regions. 


12.1.4. a) Prove that the boundary of an open ball B,(a) is given by 
dB,(a) = {x: |x — all =r}. 


b) Prove that B,(a) is a Jordan region for alla € R” and allr > 0. 


12.1.5. Let E be a Jordan region in R”. 


a) Prove that E° and E are Jordan regions. 

b) | This exercise is used in Section 12.2. Prove that Vol(E°) = Vol(E) = 
Vol(E). 

c) Prove that Vol(Z) > Oif and only if E° 4 @. 

d) Let f : [a,b] — R be continuous on [a, b]. Prove that the graph of 
y = f(x), x € [a, b], is a Jordan region in R?. 

e) Does part d) hold if continuous is replaced by integrable? How 

about bounded? 


12.1.6 |. This exercise is used in Section *12.5. Suppose that E;, E2 are Jordan 


regions in R”. 
a) Prove that if FE; C Eo, then 
Vol(E1) < Vol(E2). 


b) Prove that E; M E2 and EF \ E> are Jordan regions. 
c) Prove that if Ej, E2 are nonoverlapping, then 


Vol(E, U Ex) = Vol(E,) + Vol(E2). 
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d) If Eo C E), prove that 
Vol(E; \ E2) = Vol(E) — Vol(E2). 
e) Prove that 


Vol(E U Ex) = Vol(E}) + Vol(E2) — Vol(E, M £2). 


12.1.7 |. This exercise is used in Section *12.6. Let E Cc R”. The translation of 
E by an x € R” is the setx + E = {y € R”: y=x+z forsomezeé F}, 
and the dilation of E by a scalar a > Ois the setaE = fy € R”: y= 
az for some z € E}. 


a) Prove that E is a Jordan region if and only if x+ £ is a Jordan region, 
in which case Vol(x + £) = Vol(£). 
b) Prove that E is a Jordan region if and only if w£ is a Jordan region, 
in which case Vol(wE) = a” Vol(E£). 
12.1.8. A set E C R” is said to be of measure zero if and only if given e > 0 
there is a sequence of rectangles R,, R2,... which covers E such that 
yi lRel Se: 


a) Prove that if E C R” is of volume zero, then E is of measure zero. 
b) Prove that if E C R” is at most countable, then E is of measure zero. 
c) Prove that there is a set E C R* of measure zero which does not 
have zero area and, in fact, is not even a Jordan region. 
*12.1.9. Show that if £ C R” is bounded and has only finitely many cluster 
points, then E is a Jordan region. 


12.2 RIEMANN INTEGRATION ON JORDAN REGIONS 


By analogy with the one-variable case, the integral of a nonnegative function 
f over a Jordan region E should be the volume of the set {(x, tf) : x € E,0 < 
t < f(x)}. We should be able to approximate this volume by using (n + 1)- 
dimensional rectangles whose heights approximate t = f(x) and whose bases 
belong to some grid on E (see Figure 12.5). This leads us to the following defi- 
nition (compare with Definition 5.13). 


12.15 Definition. 


Let E be a Jordan region in R”, let f : E — R be a bounded function, let R 
be an n-dimensional rectangle such that E C R, and let G = {Rj,..., Rp} bea 
grid on R. Extend f to R” by setting f(x) = 0 forx eR” \ E. 


i) The upper sum of f on E with respect to G is 


UFG):= D> MiIRjl, 


Rj NEAD 


where Mj = supy. Rr; S&). 
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ii) The lower sum of f on E with respect to G is 


LG) := DS) mylRyl, 


RjNEFO 


where mj; = infxe Rr, f(). 
iii) The upper and lower integrals of f on E are defined by 


w | f(x) dx:= w | f dV := sup L(f, 9) 
E E G 


and 
w | Cee w | fav :=infU(f, 9), 
E E G 


where the supremum and infimum are taken over all grids G on R. 


FIGURE 12.5 


Using the fact that f(x) = 0 when x € R \ E, we can modify the proofs of 
Remarks 5.7, 5.8, and 5.14 to establish the following result. 


12.16 Remark. Let E be a nonempty Jordan region in R", let f : E > R be 
bounded, and let R be a rectangle which contains E. 


i) IfG and H are grids on R, then 


L(f,9) s U(F, A). 


478 Chapter 12 Integration on R” 


ii) The upper and lower integrals of f over E always exist, do not depend on the 
choice of R, and satisfy 


wf favew | sav. (6) 


12.17 Definition. 


A real-valued bounded function f defined on a Jordan region E is said to be 
(Riemann) integrable on E if and only if for every « > 0 there is a grid G 
such that 


U(f,G)-LUF,G) <e. 


By modifying the proof of Theorem 5.15, we can establish the following result. 


12.18 Remark. Let E be a Jordan region in R" and suppose that f : E > Ris 
bounded. Then f is integrable on E if and only if 


wf rav=w | sav. (7) 


When f is integrable on FE, we denote the common value in (7) by 


[ feos or [trav 


and call it the integral of f over E. For n = 2 (respectively, n = 3) we shall fre- 
quently denote the integral /;, f dV by J, f dA (respectively, by [ff f dV). 

The following result shows that evaluation of Riemann integrals over Jordan 
regions reduces to evaluation of Riemann integrals over rectangles. 


12.19 Remark. Let E be a Jordan region in R", let R be an n-dimensional rect- 
angle which contains E, and suppose that f : E — Ris integrable on E. If 


_ Jf) xeE 
at ee 


then g is integrable on R and 


[rav= f eav. (8) 


Proof. By Definition 12.15, the upper and lower sums of f and g are iden- 
tical; hence, they have the same upper and lower integrals. It follows from 
Remark 12.18 that they have the same integrals. a 
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This last proof worked because we defined the upper and lower integrals of 
a function f on E by extending f to be zero off E. We did this to be sure that 
U(f; G) was an overestimate of the integral of f and L(f; G) was an underesti- 
mate. Unfortunately, the abrupt change from f to 0 at the boundary of E intro- 
duces additional complications. The next result shows that since the boundary 
of E is of volume zero, we can ignore what happens at the boundary. 


12.20 Theorem. Let E be a Jordan region and suppose that f : E > Ris 
bounded. Then given € > 0 there is a grid Go such that if G := {R1,..., Rp} is any 
grid finer than Go and M;, mj are defined as in Definition 12.15, then 


W) | Foo dx- S> Mj IRiIl| <e 


R; CE? 


and 


(L) | fo ax > mj (Ril | <e. 


R;CE° 


Proof. Let « > 0 and choose M > 0 such that | f(x)| < M for allx ¢€ E. 
Since Vol(0E) = 0, we can choose a grid 7; such that V(0E; 1) < e/(2M),. 
Moreover, by the Approximation Property of Infima, we can choose a grid 
‘H> such that 


w) | f (x) dx < U(f; Fn) < wf f (x) dx + a 
E E 2 


Let Go be a grid finer than both 7; and #2, and suppose that G = 
{R,,..., Rp} is finer than Go. Since each R; is connected, it is easy to see that 
if Rj intersects E but R; is not a subset of E°, then R; intersects 0E. [Indeed, 
if Rj, dE = 9, then the pair E°, (R"\E)° separates R ja contradiction since 
all rectangles are connected—see Remark 9.59.] Since G is finer than 7; and 
Hy, it follows that 


é 
W) | fo ax- dD MIR) <5 +|UGiD— Dy Mj Ii 

a RjCE° RjCE® 
&é 

<5t+ Dy IMIR) 

RjNIEAD 

é 

<=++M V(0E;G) <e. 


2 


A similar proof establishes the inequality involving lower sums and lower 
integrals. a 


It follows that if E° = %, then the upper and lower integrals of any bounded 
f are zero; that is, [, f dV = 0. 
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Can we avoid worrying about the boundary by redefining the numbers M; and 
mj in Definition 12.15? For example, why not just define Mj = supyerjng f(x)? 
This approach will not work because the infimum of these upper sums will not 
equal the integral of f. For example, suppose that Go = {[0, 1] x [0, 1]} and 
that G = {R1, Ro, R3, R4}, where the R;’s are formed by bisecting the sides 
of Go; that is, each R; is exactly one-fourth of the unit square. Let E = R and 
suppose that f = 1 on R?, but f = —1 otherwise. If M; is defined as above, 
then U(f, Go) = 1 but U(f,G) = —1/2, which is LESS than J, f dV = 1/4. 
Evidently, in order to define f on E by looking at grids on a rectangle which 
contains E, we must extend f to be zero off E. 

Our first application of Theorem 12.20 is an analogue of Theorem 5.10. 


12.21 Theorem. [f E is a closed Jordan region in R" and f : E — Ris continu- 
ous on E, then f is integrable on E. 


Proof. Since by hypothesis E is closed and bounded, f is bounded on E 
(apply the Extreme Value Theorem and the Heine—Borel Theorem). To show 
that f is integrable on E, let e > 0 and R be a rectangle which contains E. By 
Theorem 12.20, there is a grid Go on R such that if G = {Rj,..., Rp} is any 
grid which is finer than Go, then 


wf rav-w | rav— > (Mj — mj) |Rjl | <e. (9) 


Rj CE? 
Since f is uniformly continuous on E, choose 6 > 0 such that 
IIx—yl|<6 and x,yeE& imply |f(x)— fiy)l <e. 


Make G finer by insisting that for each Rj € G, ||x — yl] < 6 whenx,y € Rj. 
Then the choice of 5 implies that Mj — mj; < « for all j which satisfy Rj C E. 
Hence it follows from Remark 12.16 and (9) that 


o<wf fav-a) | fav <e+ > (Mj = mj)\RjI 
E E 


Rj CE? 
<e+eV(E;G) < «(14+ |R)). 


Since ¢ > 0 was arbitrary, we conclude that (U) J, f dV = (L) J, f dV (ie., f 
is integrable on E£). a 


The proof of Theorem 12.21 shows that the hypothesis that E be closed can 
be weakened if we insist that f be uniformly continuous on E£. (See also Exer- 
cises 9.7.8 and 12.2.11.) 

The following result shows that the volume of a Jordan region can be com- 
puted by integration. 
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12.22 Theorem. [f E is a Jordan region, then 
Vol(E) = / 1 dx. 
E 


Proof. By Exercise 12.1.5b, we may suppose that E is closed. Let R be a 
rectangle containing E and G = {Rj,..., Rp} be agrid on R. Define f(x) = 1 
for x € E and f(x) = 0 for x ¢ E, and notice by Theorem 12.21 that f is 
integrable on E. Since Rj N E # Yimplies Rj N E # VG, and M;(f) = 1 when 
Rj OE # Y, it is clear, by the definition of upper sums and outer sums, that 
U(f,G) < V(E; Gg). Taking the infimum of this inequality over all grids G, and 
applying Theorem 12.21 together with Definitions 12.15 and 12.5, we have 


/ 1 dx = inf U(f,Q) < inf V(E; G) = Vol(E). 
E G G 


On the other hand, since Vol(dE) = 0, given e > 0 we can choose G so that 
V(OE;G) < e. Since mj(f) = 0 when Rj 9 ES # GU, and m;(f) = 1 when 
R; C E, it follows that 


[axe Leg = mir; 
E 


Rj NEAD 
= Rie YS R= > Rl 
RjCE R|NESD Ri NEA 


II 


V(E; G) — V(0E; G) = Vol(E) — «. 
Since e > 0 was arbitrary, it follows that f pl dx = Vol(E). | 


As in the one-dimensional case, the integral of a sum of functions over a union 
of regions can be broken into simpler pieces. 


12.23 Theorem. [LINEAR PROPERTIES]. 
Let E be a Jordan region in R", let f, g : E > R, and let a bea scalar. 


i) If f, g are integrable on E, then so are af and f + g. In fact, 


[erava=af rav (10) 


and 


[+ma=f rave [ eav. (11) 


ii) If Ej, Ey © E are nonoverlapping Jordan regions and f is integrable on 
both E\ and E, then f is integrable on E, U E2 and 


i fav =f fav+] fav. (12) 
E\UE2 E\ E2 
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Proof. We suppose for simplicity that a > 0. Let e > 0 and choose a grid G 
such that 


UFG-e< | fav <LE@ +e. (13) 
E 


Notice that U(af,G) = aU(f, G) and Liaf,G) = aL (f,G). Multiplying (13) 
by a, we obtain 


U(af, G) — ae < a | f dv < L(af,G) +a. 
E 

In particular, 

inf U(of, G) < «| fdV +ae 

E 

and 

sup L(af, G) > a | f dV —ae. 

G E 
Taking the limit of these inequalities as e — 0, we conclude that 


inf U(af, G) < af f dv <supL(af, 9). 
G E G 


This proves (10). 
To prove (11), choose a grid G such that 


uf.g-e< | fav el G G+e 
E 


and 
U(e.9)-e< | gdV <L(g,G)+6. 
E 


Adding these inequalities, we have 
U(F,9)+UG9)-2% < f fav+ f gaV <LF,G)+L,9) 42 
E E 


By definition, U(f + 8,9) < U(f,G) + U(g,G) and L(f + 8,9) = L(f,G) + 
L(g, G). Therefore, 


uif+eg—w<f pave f gav < Ls +99) 426 
E E 
that is, 
infu +8.9) = | fave | gaVv <supli(f+e.9). 
G E E G 


This proves (11). 
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To prove (12), let ¢ > 0 and apply Theorem 12.20 three times to choose a 
grid Go so that if G = {Ri,..., Rp} is finer than Go, then 


i! fdvV— S> Mj\Rj\| <e (14) 
vi RjCEP 
fori = 1,2, and 
(U) fdv—  S°  MjI\Rj\| <e. (15) 
E\UE2 Rj C(E\UE2)° 


Since E; and E2 are nonoverlapping, we may also assume that 
V(E| Eo; G) <e. (16) 


Let M = max{|Mj|,...,|Mp|}. Since each R; is connected and E7N ES = 9, 
it is easy to see that each R; C (E; U E2)?° satisfies one and only one of the 
following three conditions: i) R; C E?; ii) Rj C E5; or til) Rj NE, OE #9. 
Hence, it follows from (15), (16), and (14) that 


(U) fdv<e+ > MIR; 
E, UE, RjC(E\UE2)? 
<e+ > M|Rj|+ D> Mj|Rj|+M VEN E2;G) 
RCE; Rj CES 


<3e+ [ fav+| fdV+Me. 
E\ Ey 
Since ¢ > 0 was arbitrary, we obtain 


(U) fav [ fdV+] fdv. 
E, UE) Ey Ey 


A similar argument establishes 
wf save] pavef sav. 
E| UE) Ey, Ey 


Thus (12) holds. a 


The following result shows that the value of an integral remains the same 
when the integrand is changed on a set of volume zero (compare with Exer- 
cise 5.1.6). 
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12.24 Theorem. Let E be a Jordan region in R" and let f,g : E —> R be 
bounded functions. 


i) If Eo is of volume zero, then g is integrable on Eo and 


i. gdV =0. 
Eo 


ii) If f is integrable on E, if Eo is a subset of E of volume zero, and if g(x) = 
f() for allx € E \ Eo, then g is integrable on E and 


[eav=f rav. 


Proof. i) If Ej # %, then Epo contains a ball, hence a nondegenerate rect- 
angle, so Vol(E) > 0, a contradiction. Since Ej = 9G, it follows from Theo- 
rem 12.20 that 


w) | gdV =(L) gdV =0. 
Eo Eo 


ii) Since f = g on E \ Eo, it follows from the proof of Theorem 12.23ii and 
part i) above that 


[ fav = fdvV+] fav 
E E\Eo Eo 


=u) | edv +) f gdV =u) | gdV. 
E\Eo Eo E 


Similarly, f,, fdV < (L) f,gdvV. a 


This suggests a way to define the integral of f on a Jordan region E when f is 
not defined on all of £. Indeed, if f is defined on EF \ Eo, where Eo is of volume 
zero, and if the function 


— F(x) xE€E\ Eo 
at xe Eo 


is integrable on E, then define 


[fav ia got 
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Boia 2 
[> dx= f+ dax=4. 
0 x—l 0 


Henceforth, the phrase “f : E — Ris integrable” includes the possibility that 
f may not be defined on a subset of E of volume zero. 

The following result is a multidimensional analogue of Theorems 5.21 
and 5.22. 


For example, 


12.25 Theorem. [COMPARISON THEOREM FOR MULTIPLE 
INTEGRALS]. 

Let E be a Jordan region in R" and suppose that f, g : E — Rare integrable 
on E. 


i) If f(x) < g(x) forx € E, then 
/ fdaVv«< / gdVv. 
E E 
ii) Zfm, M are scalars which satisfy m < f(x) < M forx € E, then 
m Vol(E) < | f dV <MVol(E). 
E 


iii) The function | f | is integrable on E and 


[[rav 


Proof. i) If f < g on E, then L(f,G) < L(g, GQ) for any grid G. Taking the 
supremum of this inequality over all grids G verifies part i). 
ii) By Theorem 12.22, (10), and part i), 


=a Ifldv. (17) 
E 


mvoi(E) = f max = | save | M dx = M Vol(E). 
E E E 


iii) Let e > 0 and choose by Definition 12.17 a grid G = {R,..., Rp} such 
that 
U(f,G) - LUG) <e. (18) 
By repeating the argument which verified (10) in Theorem 5.22, we have 


sup | f()| — inf |f00| < sup fo) — inf f(x). 


xeR; xeR; 
Hence, it follows from (18) that 


U(If|,G) —LUFI,G) < UCf,G) — LIF,G) <e. 
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Thus | f| is integrable on E. Since —|f| < f < |f|, we conclude by part i) that 


-firiavs | rave f isiav. a 


12.26 Theorem. [MEAN VALUE THEOREM FOR MULTIPLE 
INTEGRALS]. 

Let E be a Jordan region in R” and let f,g : E — R be integrable on E with 
g(x) > Oforallx € E. 


i) There is anumber c satisfying 


inf f(x) <c < sup f(x) (19) 
xeE xeE 


such that 


c[sav=| feav. (20) 


ii) There is anumber c satisfying (19) such that 
c Vol(E) =), f dv. 
E 


Proof. i) By hypothesis, the product fg is integrable on E (see Exer- 
cise 12.2.8). Let m = infxer f(x) and M = supyer f(x). Since g > 0, The- 
orem 12.25 implies that 


mJ save] feavem| gav. (21) 
E E E 


If f,gdV = 0, then f, f(x)gdV = 0 by (21), so (20) holds for any c. If 
Jn gdV £0, then (20) holds for 


oa defeav 
tng dV 


ii) Apply part i) to g(x) = 1. a 


We close this section with some optional material which generalizes a concept 
introduced in Section 9.8. 
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*12.27 Definition. 


A set E C R’" is said to be of measure zero if and only if for every ¢ > 0 there 
is a countable collection of rectangles {Rj} jen such that 


CO 
2a oll ON ce and DIRS S 
j=l 


*12.28 Remark. /f F,, E2,... is a sequence of subsets of R" and each Ex is of 
measure zero, then 


is also of measure zero. 


Proof. Let ¢ > 0. For each k € N, choose a collection of rectangles {RM} jeN 
which covers FE; such that 


= € 
(k) 
Da ae 
j=l 
Clearly, the collection {RO} j,keN 18 countable, covers E, and 


oO wo k Co € 
Ds gee 


k=1 j=l k=1 
Consequently, E is of measure zero. | 


Every singleton E = {a} in R” is of measure zero. In fact, by comparing 
Definition 12.27 with Theorem 12.4, it is clear that every set of volume zero is 
a set of measure zero. The converse of this statement is false. Indeed, for each 
a € R the set {(a, y) : y € [0, 1]} is of volume zero and, hence, is of measure 
zero. Thus, by Remark 12.28, E := Q x [0, 1] is a set of measure zero. On the 
other hand, it is clear that Vol(E) = 0 < 1 < Vol(E), so E is not a set of volume 
zero; in fact, E is not even a Jordan region. 

An analogue of Lebesgue’s Theorem holds for multiple integrals. 


*12.29 Theorem. Let E be a Jordan region and let f : E — R be bounded. 


i) fis Riemann integrable on E if and only if the set of points of discontinuity of 
fon E is of measure zero. 
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ii) Suppose that V is an open set in R" such that E C V, and that @ : V — R" 
is 1-1 and $7' is C! on @(V) with Ag-! #0. [f f is integrable on $(E), then 
f oo is integrable on E. 


Proof. i) This part can be verified by modifying the proof of Theorem 9.69 
(see Spivak [12], p. 53). 

ii) By part i) and Theorem 12.10, it suffices to show that the set of points 
of discontinuity of f o @ on E is a set of measure zero. Let e > 0. Since f 
is integrable on @(£), its set of points of discontinuity, D, can be covered by 
cubes Q, such that 77°, |Qx| < . Set W = @ | and apply (2), with in place 
of @, to choose an absolute constant C and cubes oF such that W(Qx) © oY 


and |Q¥| < C|Q,|. Then {Q¥} covers y(D) = 7!(D) and 


Ye 2C ¥ od ece. 


k=1 k=1 


Hence, @ '(D) := w(D) is a set of measure zero. But since D is the set of 

points of discontinuity of f on @(E), @~'(D) is the set of points of disconti- 

nuity of f og on E. Hence f o @ is Riemann integrable by part i). a 
EXERCISES 


12.2.1. Using Exercise 1.4.44, compute the upper and lower sums 
U(f: Gn); Lf, Gn) for m € N, where f(x, y) = xy and G,, is deter- 
mined by 


PAGy JH Ak 2? ve 0, ly pay 
for j = 1, 2. Prove that 


Rue US, Gm) = L(f, Gm) = 0. 


12.2.2. Let E be a Jordan region in R” with E C [0,1] x --- x [0, 1]. If f gare 
integrable on E with 


[fava and [eav=-1. 
E E 


and if g(x) < f(x) for all x € E, prove that for each j ¢€ {1,2,...,n} 
there is a0 < t; < 2 such that 


/ x7 (f(®) — g(x) dx = 1). 
E 


12.2.3 |. This exercise is used in Sections 12.4, 13.5, and 13.6. Let E be an open 
Jordan region in R” and xp € E. If f : E — Ris integrable on E and 
continuous at xo, prove that 


12.2.4. 


12.2.5. 


12.2.6. 


12.2.7. 


12.2.8. 


12.2.9. 


12.2.10. 
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f dV = f (Xo). 


| ceegererceee 
r>0+ Vol(B;(X0)) JB, (x) 


a) Suppose that E is a Jordan region in R” and that f, : E — R are 
integrable on E fork € N. If f, — f uniformly on E ask > ~, 
prove that f is integrable on FE and 


tim, ff Fits) ax= | fo dx. 
kK>o JE E 


b) Prove that 


lim II cos(x/k)e’/* dA 
kK>oJJE 


exists, and find its value for any Jordan region E in R’. 
If Eo C E are Jordan regions in R” and f : E — Ris integrable on E, 
prove that f is integrable on Eo. 
Let H be a closed, connected, nonempty Jordan region and suppose 
that f : H — Ris continuous. If g : H — Ris integrable and nonneg- 
ative on H, prove that there is an xp € H such that 


F (xo) i; ae / ejeerax 
H H 


Suppose that OQ := {(x,y) € R: x > Oand y > O} and that f isa 
continuous function on R* whose first-order partial derivative satisfies 
lfxl < 1. If 


1 
FG 5)2= aff an (fe 9~ FO AH. W 


for (x, y) € Q, prove that F is bounded on Q. 

[Hint: You may use polar coordinates to change variables in F.] 
Suppose that E is a Jordan region in R” and that f,g : E > Ris 
integrable on E. 


a) Modifying the proof of Corollary 5.23, prove that fg is integrable 
on E. 
b) Prove that f v g and f A g are integrable on E (see Exercise 3.1.8). 


Suppose that V is open in R” and that f : V > Ris continuous. Prove 


that if 
[fav=o 
E 


for all nonempty Jordan regions E Cc V, then f =OonV. 
Suppose that £ is a Jordan region and that f : E — Ris integrable. 
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a) If f(E) C H, for some compact set H, and ¢ : H —> Ris continu- 
ous, prove that ¢ o f is integrable on E. 
*b) Show that part a) is false if # has even one point of discontinuity. 


12.2.11. Prove the following special case of Theorem 12.291. Suppose that E 
and Eg are a Jordan regions in R”, and that f : E — Ris bounded. If 
f is continuous on E \ Eo, then f is integrable on E. 


12.3 ITERATED INTEGRALS 


If f(x1,...,Xk,.++,Xj,-++5%Xn) is defined for xz € [c,d] and x; € [a,b], j #k, 
then we shall call 


d rb d b 
[Of tenn dx; dn [ (/ Friese) dx) dy 


an iterated integral, when the integrals on the right side exist. In a similar way, 
we define higher-order iterated integrals. 

In the preceding section we defined the Riemann integral of a multivariable 
function but developed no practical way to evaluate it. In this section we show 
that, for a large collection of Jordan regions E, integrals over E can be evaluated 
using iterated integrals. 

For simplicity, we begin with the two-dimensional case. Recall that if @ : 
[a,b] — Ris bounded, then the upper Riemann integral, (U) - v o(x) dx, and 


the lower Riemann integral, (L) he ha (x) dx, both exist and are finite. 


12.30 Lemma. 
Let R = [a,b] x [c, d] be a two-dimensional rectangle and suppose that f : 
R => Ris bounded. If f (x, -) is integrable on [c, d] for each x € [a, b], then 


(L) [[ faa < wf (ci dx 
< wf (ec) aoe w) ff faa. 


Proof. Let Rj; = [xi-1, xi] x [yj-1. yj], where {xo,...,x,} is a partition 
of [a,b] and {yo,..., ye} is a partition of [c,d]. Then G = {Rij : i = 
1,2,...,k, 7 =1,2..., €} isa grid on R. 

Let ¢ > 0, choose G so that 


(22) 


U(f.G)—© < WU) II. fda, (23) 


and set 


Mj = sup f(x,y). (24) 
(x, y)E Ri; 
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Since (U) J” p(x) dx = Df) f(x) dx and 


b b b 
wf (P(x) + W(x) dx < wf p(x) ax+(W) f W(x) dx 


for any bounded functions ¢ and w defined on [a, b] (see Exercise 5.1.7), we 
can write 


b d 
w) | ( if f(s. yidy) x= YW) [ (2 naa fo, ve) dx 
‘ £ Yj 
Da vf ( fs.» dx 
== Yj-1 
k L 
aes MijQG — 4 -D OQ; — yj-1) = UF). 


It follows from (23) that 


b d 
wf (/ flr yndy)ax <0) ff faare, 
a Cc R 


Taking the limit of this inequality as e — 0, we obtain 


wf (f'remay)arsw ff raa. 
Of (frena)arew ff raa. | 


We are now prepared to show that, under reasonable conditions, a double 
integral over a rectangle reduces to an iterated integral. 


Similarly, 


12.31 Theorem. [FUBINI’S THEOREM]. 

Let R = [a, b] x [c,d] be a two-dimensional rectangle and let f : R > R. 
Suppose that f (x, -) is integrable on [c, d] for each x € [a, b], that f(-, y) is in- 
tegrable on [a, b] for each y € [c, d], and that f is integrable on R (as a function 
of two variables). Then 


b d d b 
[[rea=[of[ temaras= [of pes avay. (25) 


NOTE: These hypotheses hold if f is continuous on the rectangle [a, b] x [c, d]. 
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Proof. For each x € [a, b], set g(x) = Vk f(, y) dy. Since f is integrable on 
R, Lemma 12.30 implies that 


b b 
[[faa-wf g(x) dx =(L) [ 8 (x) dx. 


Hence, g is integrable on [a, b] and the first identity in (25) holds. Reversing 
the roles of x and y, we obtain 


ff saa= fo [i pen avay. 


Hence, the second identity in (25) holds. a 


The second identity in Fubini’s Theorem is as important as the first. It tells 
us that, under certain conditions, the order of integration in an iterated integral 
can be reversed. Frequently, one of these iterated integrals is easier to evaluate 
than the other. 

12.32 EXAMPLE. 


Find 


1 pl ; 
if i yer dy dx. 
0 40 


Solution. This iterated integral looks tough to integrate. However, if we change 
the order of integration, using Fubini’s Theorem, and substitute u = y?, we 


obtain 
1 pl 1 
E —2 
/ / yer dx dy = y(e” —l1)dy= ates (a 
0 JO 0 2 


The following three remarks show that the hypotheses of Fubini’s Theorem 
cannot be relaxed. First, we show that existence of both iterated integrals is 
not enough to apply Fubini. You must also verify that f is integrable in the 
two-dimensional sense. 


12.33 Remark. There exists a function f :R* — R such that f(x, +) and f(-, y) 
are both integrable on [0, 1], but the iterated integrals are not equal. 


Proof. Set 


Q2n Gye, Or Oy 0-82 ae N: 
f(x,y) = 4—-27"7*1 (x, y) € [2-7-1 27”) x [2-7 2+), n EN, 
0 otherwise. 
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Notice that for each fixed yo € [0,1), f(x, yo) takes on only two nonzero 
values and is integrable on [0, 1) in x. For example, if yo € [2~",27-"*'), 
then f(x, yo) = 27” for x © [27-",27-"+!), and f(x, yo) = —2?"t! for x € 
[2-"-!, 2-"); hence, f(x, yo) is bounded on [0, 1), and 
g-ntl Q-n 


1 
/ f (x, yo) dx =) aE -{ Q7ntl dy = 2" —2" =0, (26) 
0 Q-n Q-n-1 


The same is true for f(xo,y) when xo ¢€ ([0,1/2), but when x € 
[1/2, 1), f(xo, y) takes on only one nonzero value—namely, f(xo, y) = 4 
when y € [1/2, 1), and equals zero otherwise. It follows that 


i “pd ipl 
if i, fx. y)dy dx = f / 4 dy dx =1. 
0 Jo 1/2 J1/2 


On the other hand, by (26) we have 


1 pl 
/ / f(x, y) dx dy = 0. 
0 JO 


Thus the iterated integrals of f are not equal. | 
The function in Remark 12.33 is not bounded. The following example shows 
that even when f is bounded, existence of the iterated integrals is not enough 


to conclude that f is integrable in the two-dimensional sense. 


12.34 Remark. There exists a bounded function f : R? —> R such that f (x, -) 
and f (-, y) are both integrable on [0, 1], but fis not integrable on [0, 1] x [0, 1]. 


Proof. Set 


1 wy =(4, 5), O<p,q <2",neN, 


0 otherwise. 


few 


Notice that if x9 = p/2”, then f(xo, y) = 1 only when y = g/2” for some 
q =1,2,...,2” — 1. Hence, for each fixed x9 € [0, 1], f(xo, y) = 0 except for 
finitely many y’s. It follows from Exercise 5.1.6 that 


1 
i fasvdy=0 


for all x € [0, 1]. A similar statement holds for the dx integral. Consequently, 


1 1 1 1 
/ / fix. y)dy dx = f / f(x, y) dx dy = 0. 
0 JO 0 JO 
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To see that the double integral of f does not exist, let Rj := [a, b] x [c, d] 
be a nondegenerate rectangle in [0, 1] x [0, 1]. It is easy to verify that [a, b] 
and [c, d] both contain irrational points and points of the form p/2” (just use 
density of irrationals, and repeat the proof of Theorem 1.18 with 2” in place 
of n). Thus if G = {R;} is a grid on [0,1] x [0,1], then Mj(f) = 1 and 
mj(f) = 0 for all j, and U(f,G) — L(f,g) = 1—0 = 1. Hence, f is not 
integrable on [0, 1] x [0, 1]. a 


Thus we cannot assume that a function of several variables is integrable just 
because its iterated integrals exist and are equal. (See also Exercises 12.3.5 
and 12.3.9.) 

The next result is starred because it uses Lebesgue’s characterization of 
Riemann integrability (see Theorems 9.69 and 12.291). 


*12.35 Remark. There exists a function f : R* > R such that f is integrable on 
[0, 1] x [0,1], fC, y) is integrable on [0, 1] for all y € [0,1], but f(x, -) is not 
integrable on [0, 1] for infinitely many x € [0, 1]. 


Proof. Let 


FEE 0 when x = 0 or when x or y is irrational 
— 1/q when x, y € Qand x = p/gq is in reduced form. 


By the argument of Example 3.33, the function f is continuous and zero on 
the set (0, 1] \ Q) x [0, 1]. Hence, by Lebesgue’s Theorem, f is integrable 
on the square R = [0, 1] x [0, 1]. By computing its lower sums, we find that 
Ife fdA =0. 

Similarly, for each y e€ [0,1], f(,y) is integrable on [0, 1] with 
His f (x, y)dx = 0. Thus 


[ ( [50.0 ix) w= ff saa=o 


On the other hand, since for each nonzero x € Q the function f(x, -) is 
nowhere continuous, it cannot be integrable on [0, 1]. Therefore, the other 
iterated integral in Fubini’s Theorem does not exist. a 


Fubini’s Theorem shows us how to evaluate a double integral over a rectangle 
by means of iterated integrals. The following result shows that the integral of 
a continuous function over a rectangle in R” can be evaluated using n partial 
integrals. 


12.36 Lemma. 

Let R = [a,, b1|]x---x[dy, by] be an n-dimensional rectangle and let f : R—> R 
be integrable on R. If, for each x := (x1,...,Xn-1) € Rn := [a1,b1] x +--+ x 
[an—1, bn-1], the function f (x, -) is integrable on (dy, by], then 
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bn 
/ t,t) dt 
an 


is integrable on Ry, and 
Dn 
i f(, t) d(x, t) = / F(x, t) dt dx. (27) 
R Rn Jan 
Proof. By repeating the argument of Lemma 12.30, we have 


bn 
w | f(x, t) d(x,t) < w | J (x, t) dt dx 
R R 


nv an 


bn 
< w) | f(x, t) dt dx 
Ry Jan 
< wf J (x, t) d(x, t) 
R 


for any bounded f. Since f is integrable on R, it follows that (27) holds. lf 


Using this result in conjunction with Remark 12.19, we can evaluate integrals 
over a large collection of nonrectangular Jordan regions. To this end, we shall 
call a nonempty set E C R” a projectable region if and only if there is a closed 
Jordan region H C R"~!, an index j ¢€ {1,...,n}, and continuous functions 
o, wv: H — Rsuch that 


ES{GiacyQe Rs Gig hi ie) ea 


and PIA ervey ery cae te) SX; < AC charmer Seep. 20 |e 
[The notation x; means the variable x; is missing; hence, (x1,...,Xj,...,%n) is 


a point in R’~!.] In this case, we say that E is generated by j, H, ¢, and . 

We are more specific for regions in R* and R®. A set E C R’ is called a region 
of type lif and only if E = {(x, y): x € [a,b], 6) < y < W(x)} and a region 
of type IT if and only if E = {(x, y): y € [a,b], (vy) < x < W(y)}, where ¢, wv: 
[a, b] > Rare continuous functions. Similarly, a set E C R? is called a region of 
type Tif and only if E = {(x, y,z): («, y) € A, d(, y) <z < Ww, y)}, a region 
of type ITif and only if E = {(x, y,z): (4,2) € A, 0,2) < y< Ww, gz}, anda 
region of type IITif and only if E = {(x, y,z): 0,2) € A, 6,2) <x < v0.2}, 
WEEtE o,w : H — Rare continuous functions and Z is a closed Jordan region 
in R*. 


12.37 EXAMPLE. 


Prove that the set E in R* bounded by y = x and y = x? is a region of types I 
and IT. 
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Proof. The set E can be described by 


{, y)i:x*°<y<x,x€[0,1} or {@, y):y<x</y, y €[0, 1} 


(see Figure 12.6). a 


Ay 


HY 


FIGURE 12.6 


12.38 EXAMPLE. 


Prove that the set E of points (x, y, z) which satisfy 4x? + y? +z? < lis aregion 
of types I, II, and HI. 


Proof. The set E, an ellipsoid, can be described by 


E={(x,y,2):—\/1—4x2 — y? <z<,/1—4x? — y”, (x, y) € A}, 


where H = {(x, y) : 4x? + y* < 1}. A similar argument shows that E is of 
types II and II. fe 


Before we show how to evaluate multiple integrals over projectable regions, 
we introduce additional terminology. For each k = 1,...,n the set 


My, = {x € RK’: x, =0} 


will be called a coordinate hyperplane. Given a set E C R", the pro- 
jection of E onto the coordinate hyperplane IIx, is the set E, of points 
(X1,.--, Xk-1, 0, Xe41,---, Xn) Such that (x1,..., x4, ...,%,) € E forsome x, € R. 
For example, in R’ the coordinate hyperplane I]; corresponds to the yz-plane, 
and the projection of the three-dimensional ball B,(xo, yo, zo) onto IT; is essen- 
tially the two-dimensional ball B,(yo, zo) (see Figure 12.7). 

The following result shows that multiple integrals over most projectable 
regions can be evaluated using iterated integrals. 
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FIGURE 12.7 


12.39 Theorem. Let E be a projectable region in R" generated by j, H, o, and w. 
Then E is a Jordan region in R". Moreover, if f : E — Ris continuous on E, then 


W(X] 0) Xj ye Xn) = 
[ fous | / f (1, ~.+,%n) dxj | d(x, ...,Xj,...,%n). 
E H (X14... ¥j,..-Xn) 


(28) 


Proof. By symmetry, we may suppose that j =n. Thus 
E=({(x,t):x=(11,...,%n-1)€H and (x) <t< Ww}. 


To show that £ is a Jordan region, we must show that the volume of 0 E£ is zero. 
Now dE is made up of “lower-dimensional pieces,” a bottom B = {(x, ft): xe H 
and t = $(x)}, atop T = {(x,t): x € H andt = w(x)}, and aside S = {(x,f): 
x € 0H and ¢(x) < t < wW(x)}. (Figure 12.8 illustrates the situation for the 
case n = 3.) Hence, we must show that B, T, and S are of volume zero. 

To estimate the volume of B, notice that since H is compact, ¢ is uniformly 
continuous on H. Thus, given e > 0, there is a 6 > 0 such that 


x,y¢€H and ||x—yl|<6 imply |¢(x) — ¢(y)| <e. (29) 


Since H is bounded, H is contained in some (n — 1)-dimensional cube Q. 
Divide Q into subcubes Q|,..., Q» such that x, y € Q x implies ||x — y|| < 6, 
and let R, = Ox x [b(ax) — 2, b (ax) + 2e] for some ag € Ox, K=1,2,..., p. 
Then G := {R},..., Rp} is grid in R”, and, by (29), 


P P 
V(B;G) <)> [Rel =4e } 1Qxl = 4e1 Ql. 


k=1 k=1 
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FIGURE 12.8 


It follows from Theorem 12.4 that B is of volume zero. A similar argument 
shows that T is of volume zero. 
To estimate the volume of S, set 


M = sup w(x) and m = inf (x). 
xeH xeH 


Since H is a Jordan region, choose a grid {Q),..., Qp} in R"—! such that 


Yo 1Ol<e. 


OLNIH AD 
Set Ry = Qx x [m, M] and observe that G := {R1,..., Rp} is a grid in R”, and 


P 
V(S;G) <)> [Rel < (M-mode. 


k=1 


Hence it follows from Theorem 12.4 that S is of volume zero. We conclude 
that dE is of volume zero (i.e., E is a Jordan region). 

To prove (28), let R = [a1, b1] x --- x [an, bn] be an n-dimensional rectangle 
which contains EF, and define g on R by g(x, t) = f(x, tf) when (x, f) € EF, and 
g(x, t) = 0 otherwise. By Remark 12.19 and Lemma 12.36, 


by by 
[ fonaan= af B(X1,---,Xy) dX,... dx, 
E ay an 
b 


=) (/ "g(x, t) ar) dx. 
H an 
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But for each x = (x1,..., %,»_1) € H, we have 
f&t1) oo) <t< p(x) 
t)= 
8(% 1) i otherwise. 
Therefore, 
bn W(x) 
/ g(x, t)dt= / Ft (x, t) dt. a 
Qn p(x) 


Although we have stated Theorem 12.39 for continuous /, the result is evi- 
dently true whenever Lemma 12.36 applies; for example, if f is integrable on E 
and f(x, -) is integrable on [ay, b,] for each fixed x € H. 

If the set H is itself projectable, then Theorem 12.39 can be applied again 
to H. Thus if E is nice enough, an integral over E can be evaluated using n 
partial integrals. We close this section with several examples which illustrate 
this principle for the cases n = 2 andn = 3. 


12.40 EXAMPLE. 


Find the integral of f(x, y, z) = x over the region EF bounded by z = 1—x— y, 
x =0, y=0,andz=0. 


Solution. The surfaces z = 0 and z = 1 — x — y intersect when y = | — x. The 
projection £3 is bounded by the curves x = 0, y = 0, and y = 1 — x. These last 
two curves intersect when x = 1. Thus E is a region of type I: E = {(x, y,z): 
O<x<1,0<y<1-—x, 0<z<1-—x-— y} (see Figure 12.9). It follows that 


1 1—x 1—x-y 
ip tars] J / x dz dy dx 
E 0 JO 0 
1 1—x 
=i / (x — x” — xy) dy dx 
0 JO 


Me ie oo ee 1 
=5 | @-2 tx )dx= 7. | 


12.41 EXAMPLE. 
Find the integral of f(x, y, z) = x* over the region E bounded by |x| = 1, z = 
x? — y?, where z > 0. 


Solution. The surfaces z = 0 and z = x” — y” intersect when x? — y? = 0 (ie., 
y = +x). The curves y = +x and |x| = | intersect when x = +1. Thus the 
region FE is of type I: 


E={(x,y,z):-l <x <1,-|x| sy < |x|, 0572 <x?-y?} 
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FIGURE 12.9 


(see Figure 12.10). It follows that 


1 pixl px?—y? 
[{fsav=f i x? dz dy dx 
E —1 J—|x| JO 
1 pix 
a / / Ge y?)x? dy dx 
-1J—|x| 


1 x 8 1 A 
=a / (2 = ay ax = 5 | x dx =~. | 
0 JO 3 Jo 2 


Although Theorem 12.39 can be used in conjunction with Theorem 12.23 to 
handle the case when F is a finite union of projectable subregions, we can some- 
times avoid breaking E into subregions by changing our point of view. Here is a 
typical example. 


FIGURE 12.10 
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FIGURE 12.11 


12.42 EXAMPLE. 


Find the integral of f(x, y, z) = x —z over the region bounded by z = y”, z = 1, 
z=x,andx =0. 


Solution. The region E is a union of two regions of type I (see Figure 12.11, 
where the “back” of E is that portion of the plane x = 0 which is bounded by the 
parabola z = y”, x = 0 here represented by a dashed line). Therefore, we must 
use two integrals if we integrate dz first: the integral where z varies between y* 
and 1, and the integral where z varies from x to 1. It looks complicated to set 
up. The solution is simpler if we integrate dx first. Indeed, E is a single region 
of type III since 


PH=(G@o2e-le ee hy S221, 0 S722) 


Thus 


1 1 z 
[f{f rav=f [, [ @-aavacay 
E —1 y? (0) 


1 1 1 
--5/ [eady== | Con en a 
2 —1 y2 6 =| 7 


EXERCISES 
12.3.1. Evaluate each of the following iterated integrals. 


a) 7 i xy dxdy 
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12.3.2. 


12.3.3. 


12.3.4. 


12.3.5. 


12.3.6. 


Integration on R” 


b) fo fo xy —* dxdy 
c) fo Jo 2y sin(xy) cos(xy) dydx 


Evaluate each of the following iterated integrals. Write each as an 
integral over a region E, and sketch E in each case. 


a) fi fax +3) dydx 
b) Ws i cos(x*) dxdy 
: 2 
) fo Lilo” * dedxdy 
2 
d) fo Sy St Vx? +2 dzdxdy 


For each of the following, evaluate f,, f dV. 


a) f(x, y) = Gx?+1)7! and E is bounded by x = 1, y = 0, and y = x. 

b) f(x, y) = 2x + y? and E is the triangle with vertices (0, 0), (0, 2), 
and (2, 0). 

c) f(x,y) = 6x4e2"") and E is the triangle with vertices (0, 0), (1, 0), 
and (1, 1). 

d) f(x, y,z) = 1 and E is the set of points (x, y, z) such that 0 < z < 
1-—x,0<y <x*42xz+77,andx >0. 

Compute the volume of each of the following regions. 


a) E is bounded by the surfaces x + y+ z = 3, z=0,andx*+y? =1. 

b) E lies under the plane z = x+y and over the region in the xy-plane 
bounded by the curves x = ./y/2, x =2,/y, x+y =3. 

c) E is bounded by z =y,x=y+27,x=0,z=1. 

d) E is bounded by y = x3, x = 2”, z= x’, and y =0. 


a) Verify that the hypotheses of Fubini’s Theorem hold when f is con- 
tinuous on R. 

b) Modify the proof of Remark 12.33 to show that Fubini’s Theorem 
might not hold for a nonintegrable f, even if f(x, y) is continuous 
in each variable separately; that is, if f(x, -) is continuous for each 
x € [a, b] and f(-, y) is continuous for each y € [c, d]. 


a) Suppose that f; is integrable on [az, by] for k = 1,...,n, and set 
R = [a,, bj] x -++ X [ay, by]. Prove that 


Jf frend. falin) dG, 0 


by bn 
= ( fi) an)...{f fn@n) din) 


b) If Q = [0, 1]” andy := (1, 1,..., 1), prove that 


| ere exd\ 
O e J. 


12.3.7. 


12.3.8. 


12.3.9. 
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The greatest integer in a real number x is the integer [x] := n which 
satisfies n < x <n+1. An interval [a, b] is called Z-asymmetric if 
b+aZ[b]+[a]+1. 

a) Suppose that R is a two-dimensional Z-asymmetric rectangle (i.e., 
that both of its sides are Z-asymmetric). If w(x, y) := (« — [x] — 
1/2)(y — [y] — 1/2), prove that [{, wdA = 0 if and only if at least 
one side of R has integer length. 

Suppose that R is tiled by rectangles R;..., Ry (i.e., that the R;’s 
are Z-asymmetric, nonoverlapping, and that R = UR j)- Prove 
that if each R; has at least one side of integer length and R is 
Z- asymmetric, then RF has at least one side of integer length. 


b 


wa 


Let E be a nonempty Jordan region in R* and f : E — [0,00) be 
integrable on E. Prove that the volume of Q = {(x, y,z): (x, y) € E, 
0<z< f(x, y)} (as given by Definition 12.5) satisfies 


voi(%) = ff fda. 
E 


Let R = [a, b] x [c, d] be a two-dimensional rectangle and f: R> R 
be bounded. 


a) Prove that 


b d 
(L) [ff faa < w | (co f 46) dy) re 
b d 
<(U) / G / AES ay) re 
=<) ff fda 
R 
for X =UorXx = L. 


b) Prove that if f is integrable on R, then 


[f raa=[ (© free) re 
-{ (w [ren ay) dx. 


c) Compute the two iterated integrals in part b) for 


yeEQ 
y¢Q 


and R = [0, 1] x [0, 1]. Prove that f is not integrable on R. 


1 
row =| 
XxX 
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*12.3.10. [FUBINI’S THEOREM FOR IMPROPER INTEGRALS]. If a < b are extended 
real numbers, c < d are finite real numbers, f : (a, b) x [c,d] > Ris 
continuous, and 


b 
Fo) = | f(x, y) dx 


converges uniformly on [c, d], prove that 


d 
i f (x, y) dy 


is improperly integrable on (a, b) and 


d b b d 
[ [ tonaay= ff pe.nayae. 


12.4 CHANGE OF VARIABLES 


Recall (Theorem 5.34) that if @ : [a,b] — R is continuously differentiable and 
¢’ £0 0n [a, b], then 


/ f(t) dt = i F(O(x)) 1'@)| dx 
¢([a,b]) [a,b] 


for all f integrable on ¢({a, b]). We shall generalize this result to functions of 
several variables; namely, we shall identify conditions under which 


f(u) au= | S (G(x) Ags) dx (30) 
G(E) E 


holds. (At this point you may wish to read the discussion following the proof 
of Theorem 12.46 below to see that Ag takes on a familiar form when ¢ is the 
change from polar to rectangular coordinates.) 

It takes six or seven hypotheses to establish (30). These hypotheses fall into 
two categories: 


1) Hypotheses made so the change of variables is possible. Since the one- 
dimensional result required ¢ to be continuously differentiable and ¢’ 4 0 
(which together imply that ¢ is 1-1), we expect the corresponding hypotheses 
for (30) to be as follows: @ is 1-1, continuously differentiable, and Ag # 0. 

2) Hypotheses made so the integrals in (30) exist. There are four of these: E is a 
Jordan region, $(£) is a Jordan region, f is integrable on $(£), and fog|Ag| 
is integrable on E. In practice, only the first and third of the hypotheses in 
category 2) need be verified. Indeed, if @ satisfies all hypotheses in cate- 
gory 1) and E is Jordan, then $(£) is Jordan by Theorem 12.10, and, when 
f is integrable on $(E£), f o p|Ag| is integrable on E (see Theorem 12.2911 
and Exercise 12.2.8a). Moreover, the remaining hypotheses in category 2) 
can usually be verified by inspection. The reason for this is twofold. E 
is frequently projectable, hence a Jordan region, and most functions are 
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continuous (or nearly so), hence integrable on FE. Therefore, the crucial 
hypotheses for (30) are those in category 1), namely, that @ be 1-1, 
continuously differentiable, and Ag 4 0. 


To give an outline of a proof of (30), we introduce the following terminology. A 
function f is said to satisfy a certain property P “locally” on a set E if and only if 
given a € E there is an open set W containing a such that f satisfies P on WN E. 
f is said to satisfy the property P “globally” on E if and only if f satisfies P for 
all points in E. To prove (30), we first obtain several preliminary results which 
culminate in a “local” change-of-variables formula (see Lemma 12.45) and then 
use this to obtain a “global” change-of-variables formula for functions @ which 
are C! on an open set which contains E (see Theorem 12.46). Throughout this 
discussion, we assume that Ag is never zero. In Section 12.5, we work much 
harder to show that the condition “Ag 4 0” can be relaxed on a set of volume 
zero (see Theorem 12.65). 

Since every Jordan region can be approximated by rectangles, and every 
integrable function is almost continuous, hence locally nearly constant, we 
should consider (30) first in the case when @(£) is a rectangle and f is iden- 
tically 1; that is, we should prove that 


|R| =a |Ag(x)| dx. (31) 
$'(R) 
Our first preliminary result shows that this case is a step in the right direction. 


12.43 Lemma. 

Let W be open in R", let 6 : W — R" be I-1 and continuously differentiable 
on W with Ag 4 0 on W, and suppose that ¢ | is continuously differentiable 
on @(W) with Ag-1 #Oon @(W). Suppose further that (31) holds for every 


n-dimensional rectangle R C @(W). If E is a Jordan region with E C W, if f is 
integrable on $(E), and if f o @ is integrable on E, then 


f(u) du = / (f 0 b)(x)|Ag(x)| dx. 
GE) E 
Proof. We may suppose that W is nonempty. Let E be a fixed Jordan region 
which satisfies E C W and suppose that f is integrable on (E). Set ft = 
(f|+f)/2and f~ = (|f|—f)/2. Then f* and f~ are both nonnegative and 
integrable on $(E), and f = f* — f~ (see Exercises 3.1.7 and 5.2.2). Since 
the integral of a difference is the difference of the integrals, it suffices to prove 
the lemma for the case when f > 0. 

Let ¢ > 0. Since f is integrable on $(E), choose a grid G = {Rj,..., Rp} 
such that 


f)du>UfG)-e:= > Mj\Ril—e, (32) 
(E) RiNG(E)AO 


where M; := sup f(Rj) = f(O(@'(Rj))). Moreover, since $(E) = @(E) C 
¢@(W), we may suppose, by refining G if necessary, that Rj 1¢(E) ¢ ¥ implies 
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R; C 6(W). Hence, by Lemma 12.9, {o | (Rp}R neo is a nonoverlapping 
collection of Jordan regions whose union satisfies 


a= (J oR) 2¢'@4)=E 
RjNO(E) AO 


Hence, (32), (31), and Theorems 12.25 and 12.23 imply that 


[ft du=> \  M\IRjl—-e 


RjNG(E) #9 

= es Mi fv, |Ag(x)| dx —e 
RjNG(E) FD 

> Rie f(P(x)|Ag(x)| dx —€ 
Rng(E\Z0°% (Ri) 


=f FOO) Ag) dx —¢ 
> i F@™)lAg@| dx—e. 


(For this last step, we used the fact that f > 0.) Since ¢ > 0 was arbitrary, we 


obtain 
flu) du> | f()|d90| dx. 
G(E) E 
On the other hand, by Theorem 12.20 there is a grid H = {Q,..., Op} 
such that 


fu)du<s > mj\Qj\+e. 
eur Q;C@(E))” 


where mj; := inf f(Q;) = f (o(o"! (Q;))). Repeating the steps above with 


M= LY 10) co GE) =E 
Qj C(H(E))? 


in place of Q), we see that 


fu)du<s > mj|Qj\+e 
ida Qj) C@E))? 


< / F@®)IAg)| dx-+s 
Qg 


< i: f(@(x))|Ag(®)| dx + €. 
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We conclude that 
[rem au= f reoeiagoo| ax B 
(EL) E 


Next, we show that (31) holds locally near points a when Ag(a) # 0 and ¢ is 
1-1 and C!. 


12.44 Lemma. 

Let V be open in R", and @ : V — R" be 1-1 and continuously differentiable 
on V. If Ag(a) 4 0 for some a € V, then there exists an open rectangle W such 
thata € W CV, Ag is nonzero on W, ¢~' is C! and its Jacobian is nonzero on 


@(W), and such that if R is an n-dimensional rectangle contained in @(W), then 
@~ '(R) is Jordan and (31) holds. 


Proof. The proof is by induction on n. If n = 1 and ¢’(a) # 0, then ¢’ is 
nonzero on some open interval / containing a. Hence, by Theorem 5.34, (31) 
holds for “rectangles” (i.e., intervals) in #(/). 

Suppose that (31) holds on R’~!, for some n > 1. Let @: V > R"” be 1-1 
and C! on V with Ag(a) 4 0. Expanding the determinant of A := D@(a) along 
the first row, it is clear that at least one of its minors, say Aj;, has nonzero 
determinant; that is, that w(x) := (@2(x),..., @j(%), xj,.-., bn(X)) also has 
nonzero Jacobian at x = a. Since ¢ and yw are both C! on V, it follows from 
the Inverse Function Theorem that there is an open set W C V, containing a, 
such that @ and wy are 1-1 on W, Ag and Ay are nonzero on W, and ¢ | is 
1-1, C! and A g-| #0on @(W). By making W smaller, if necessary, we may 
suppose that W is an open rectangle; that is, that there exist open intervals /; 
such that W =I, x--- x In. 

Assume for simplicity that 7 = 1; that is, that w(x) = (x1, @2(x),..., On (X)). 
For each x = (x1,...,%) € W(W) set o(x) = (b1(W7!(x)), x2, ...,Xp). It is 
clear that 6 = o o y, hence by the Chain Rule, Ag(x) = Ag (W(x)) Ay (x). In 
particular, by the choice of W, 


Ay(x) #0 and Ag(W(x)) 40 forallxe W. (33) 


To show that the inductive hypothesis can be used on y, fix t € I. Set 
Wo = Ib x--- x I, and @'(y) = (@o(t, y),.--, On(t, y)) for each y € Wo. Then 
¢' : Wo > R"~! is 1-1 and C! on Wo, and, by (33), Agi(y) = Ay(t.y) 4 0 
for all y € Wo. It follows from the inductive hypothesis that if Qo is an 
(n — 1)-dimensional rectangle which satisfies Qy C 6’ (Wo), then (@’)~!(Qo) is 
Jordan and 


|Qo| = / |Ag:(y)| dy. (34) 
(G')1(Qo) 


(Wo, hence, W, may have gotten smaller again.) 
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Let Q = Ip x Qo be any n-dimensional rectangle in w(W) and integrate 
(34) with respect to t over Jp to verify 


\1=\nol-1001= ff [Agi (y)| dy dt. 
In J(@')“!(Qo) 


But the first component of p satisfies W(t, y) = t for ally ¢ W, so Ag = Ay 


and wv '(Q) is the union of the “t-sections” (@’)~!(Qo) as t ranges over Ip. 
Hence, we can continue the identity above as follows: 


|Q| = / Ag cnlay = [ |Ay(u)| du. 
Ip J ($')-!(Qo) ¥!(Q) 


In particular, it follows from Remark 10.34 that 
g(u) du = i 8(W(x))|Ay(x)| dx (35) 
WE) E 


for all Jordan regions E which satisfy E C W, provided g is integrable on 
w(W) and g o & is integrable on E. 

Similarly, we can use the inductive hypothesis to prove that (31) holds for « 
in place of ¢ for all n-dimensional rectangles R contained in @(W). Hence, for 
each such rectangle R, we have by (35)—with E = w'(o—!(R)) = @ '(R) 
and g = |A,|—and the Chain Rule that 


|R| =) |Ag(u)| du 
o!(R) 
= |Ag (W(x))| |Ay(x)| dx 
wv '(o7!(R)) 
= |Ag(x)| dx. a 
$7! (R) 


By combining Lemmas 12.43 and 12.44, we obtain the following local version 
of the change-of-variables formula we want. 


12.45 Lemma. 
Suppose that V is open in R", thata € V, and that ¢ : V > R" is continuously 
differentiable on V. If Ag(a) 4 0, then there exists an open rectangle W C V 


containing a such that if E is Jordan with E C W, if f 0 @ is integrable on E, 
and if f is integrable on @(E), then 


Fe ne / F(O(%) [Ag(x)| dx. 
o(E) E 


This local change-of-variables formula contains the following global result. 
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12.46 Theorem. Suppose that V is open in R" and that @ : V > R” is I-I and 
continuously differentiable on V. If Ag 4 0 on V, if E is a Jordan region with 


E CV, if f o@ is integrable on E, and if f is integrable on (E), then 
[fen au= [ feoern|agoo) x (38) 
o(E) E 


Proof. Let f : ¢(E) > R be integrable, and set H := E. By Lemma 12.45, 
given a € H there is an open rectangle W, such that a € W, Cc V and 


/ (ide / F(@(%)|Ag0o| dx (39) 
(Ei) Ej 


for every Jordan region E; which satisfies E; C Wa. Let Q, be an open rect- 
angle which satisfies a € Qa C Qa C Wa. Then for each a € H there is an 
r(a) > O such that B,,a)(a) C Qa. Since the Jordan region E is bounded, H is 
compact by the Heine-Borel Theorem. Thus there exist finitely many a; such 
that H is covered by Braj (aj), J = 1,2,...,N. Hence the open rectangles 


Qj := Qa, satisfy 


N 
Hc\|JQ;. 

j=l 
Let R be a huge rectangle which contains H andG = {Rj,..., Rp} be a grid on 
R so fine that each rectangle in G which intersects H is a subset of some Q;. 
(This is possible since there are only finitely many Q ;’s; just use the endpoints 
of the Q;’s to generate G.) Let Ej; = R;NE. Then E; C RINH CQ; C Wa, for 
some j € {1,..., N}; that is, (39) holds. Moreover, the collection {£),..., Ep} 
is a nonoverlapping family of nonempty Jordan regions whose union is E; 
hence, by Theorem 1.37 and Lemma 12.9, the collection {@(£;) :i = 1,..., p} 


is a nonoverlapping family of nonempty Jordan regions whose union is @(£). 
It follows from Theorem 12.23 and (39) that 


P 
f(u) du= if f(u) du 
o(E) aD (Ei) 


P 
= | f(O(X))|Ag®)| dx = [ F(O(X))|Ag(®)| dx. a 
i=l “i 


Again, we note that in Theorem 12.46 the hypothesis that f o @ be integrable 
is superfluous—see Theorem 12.29ii. 

To see how all this works out in practice, we begin with a familiar change of 
variables in R*. Recall that polar coordinates in R* have the form 


x =rcosé, y=rsing, 
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AY 
(r, 8) = (x, y) 
0 
\ 
> 
x 
FIGURE 12.12 
where r = ||(x, y)|| and @ is the angle measured counterclockwise from the 


positive x axis to the line segment L((0,0); (x, y)) (see Figure 12.12). Set 
o(r, 9) = (r cos@,r sind) and observe that 


cos@é —rsin@ 


Ag= det | sin@ rcosdé 


= r(cos’ 6 + sin? 9) =r. (40) 


Thus we abbreviate the change-of-variables formula from polar coordinates to 
rectangular coordinates by dx dy = r dr d0. 

Although ¢ is not 1-1 [e.g., 6(0, 8) = (0, 0) for all 8 € R] and its Jacobian is 
not nonzero, this does not prevent us from applying Theorem 12.46 (i.e., chang- 
ing variables from polar coordinates to rectangular coordinates and vice versa). 
Indeed, since ¢ is 1-1 on Q := {(r,0) : r > 0,0 < @ < 27} and its Jacobian 
is nonzero off the set Z := {(r,6) : r = O}, we can apply Theorem 12.46 to 
EO{ir,0) :r > O} and let r | 0. Since the end result is the same as if we 
applied Theorem 12.46 directly without this intermediate step, we shall do so 
below without any further comments. This works in part because the set Z 
where the hypotheses of category 1 fail (see the discussion following (30) above) 
is a set of volume zero (see Theorem 12.66 below). 

The next two examples show that polar coordinates can be used to evaluate 
integrals which cannot be computed easily using rectangular coordinates. 


12.47 EXAMPLE. 


Find the volume of the region E bounded by z = x? y’, eee y? = 4, and 
z=0. 


Solution. Clearly, E lies under the function f(x, y) = x? + y? over the region 
B = B2(0, 0) (see Figure 12.13). Using polar coordinates, we obtain 


20 2 
Vol(E) = II (+ y*)dA= / < r? dr dO = 8x. a 
B 0 0 
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FIGURE 12.13 


12.48 EXAMPLE. 


Evaluate 
O45 .<2 
II ee gk 
E xX 
where E = {(x, y):.a* <x7+ y* < land0 < y < x} forsome 0 <a <1. 


Solution. Changing to polar coordinates, we see that 


2 2 m/4 pl 3 i 3 pxr/4 
i 2 aa= | [apew- . i secO dé. 
E x 0 a 1cosé 3 0 


To integrate sec@, multiply and divide by sec@ + tan@. Using the change of 
variables u = sec 6 + tan, we obtain 


[seco ae = [" sec@ tan@ + sec? 4 
0 0 


sec @ + tané 
14/2 d 
=i — = log(1 + V2). 
1 


Consequently, 


2; 2 = 38 
UE ey pee a*) log + V2) 
E x 3 


Recall that cylindrical coordinates in R> have the form 


x =rcosé, y=rsin6, ZZ, 
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where r = ||(x, y, 0)|| and @ is the angle measured counterclockwise from the 
positive x axis to the line segment L((0, 0, 0); (x, y, 0)). It is easy to see that this 
change of variables is 1-1 on Q := {(r,8,z) :r > 0,0 < 6 < 2z,z € R}, and 
its Jacobian, r, is nonzero off Z := {(r,0,z) : r = O}. We shall abbreviate the 
change-of-variables formula from cylindrical coordinates to rectangular coordi- 
nates by dx dydz = rdzdrdé@. (Note that Z is a set of volume zero. As with 
polar coordinates, application of Theorem 12.46 can be justified by applying it 
first for r > 0, and then taking the limit as r | 0.) 


12.49 EXAMPLE. 
Find the volume of the region E which lies inside the paraboloid x? + y? +z = 4, 


outside the cylinder x* — 2x + y* = 0, and above the plane z = 0. 


Solution. The paraboloid z = 4 — x* — y” has vertex (0, 0, 4) and opens down- 
ward about the z-axis. The cylinder x7 — 2x + y* = (x —1)*+y*—1 = Ohas base 
centered at (1, 0) with radius 1. Hence, the projection F3 lies inside the circle 
x*+y? = 4 and outside the circle x7 + y* = 2x (see Figure 12.14). This last circle 
can be described in polar coordinates by r? = 2r cos 6, that is, r = 2cos 6. Thus 


4—r? 
vo(e) = fff rav = ff / dzdA 
E E3 J0 


m/2 p2 3/2 p2 ll 
= / = Py dr do+ f [a-P rare =. 5 
—n/2 J2c0s6 x/2 JO 2 


nV 


FIGURE 12.14 


Recall that spherical coordinates in R* have the form 
x = psingcosé, y = psing sin6, Z = Pcosg, 


where p = ||(x, y, z)||, @ is the angle measured counterclockwise from the posi- 
tive x axis to the line segment L((0, 0, 0); (x, y, 0)), and g is the angle measured 
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ater eae i 


FIGURE 12.15 


from the positive z-axis to the vector (x, y, z) (see Figure 12.15). Notice that 
this change of variables is 1-1 on {(p, 9,0): p > 0,0 <9 < 7,0 <0 < 27} 
and its Jacobian, p* sing (see Exercise 12.4.8a), is nonzero off Z := {(p, y, 0) : 
y = 0,2, p = 0}, a Jordan region of volume zero. Hence, application of The- 
orem 12.46 can justified by applying it first for o > O and 0 < » < az, and 
then taking the limit as p,g | 0 and g + z. Since the end result is the same 
as applying Theorem 12.46 directly to any projectable region in R°, we shall 
do so, without further comments, when changing variables to or from spherical 
coordinates. We shall abbreviate the change-of-variables formula from spheri- 
cal coordinates to rectangular coordinates by dx dy dz = p* sing dp dg dé. (For 
spherical coordinates in R”, see the proof of Theorem 12.70.) 


12.50 EXAMPLE. 
Find 


I ae 
Q 
where Q = B3(0, 0, 0) \ Bo(0, 0, 0). 


Solution. Using spherical coordinates, we have 


2a pr | 
If xav=| i / p sing cos 6(p* sing) dp dy dé = 0. a 
Q 0 Jo J2 


Theorem 12.46 can be used for other changes of variables besides polar, cylin- 
drical, and spherical coordinates. 


12.51 EXAMPLE. 
Find 


I sin(x + y) cos(2x — y) dA, 
E 
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where E is the region bounded by y = 2x — 1, y = 2x +3, y = —x, and 
y=—x+l. 


Solution. Let d(x, y) = (2x — y, x + y) and observe that the integral in question 
looks like the right side of (38) except the Jacobian is missing. By Cramer’s Rule, 
for each fixed u, v € R, the system u = 2x — y, v = x + y has a unique solution 
in x, y. Hence, ¢ is 1-1 on R’. It is obviously continuously differentiable, and its 
Jacobian, 


Ou, 2 -1 
Ag, y) = ~ : = det fi I =; 


is a nonzero constant. Hence, we can make adjustments to the integral in ques- 
tion so that it is precisely the right side of (38): 


1 


If sin(x + y) cos(2x — y)dA = a a f ob(x, y)Ag(x, y) d(x, y), 


where f(u,v) = cosusinv. It remains to compute the left side of (38) (i.e., to 
find what happens to E under @). 

Notice that y = 2x — 1 impliesu = 1, y = 2x +3 implies u = —3, y = —x 
implies v = 0, and y = —x + 1 implies v = 1. Thus ¢(£) = [-3, 1] x [0, 1]. 
Applying Theorem 12.46 and the preliminary step taken above, we find 


1 1 1 
II sin(x + y) cos(2x — y)dA = a i. sinvcosu du dv 
E a fe ee 


1 
= 3 (sin) + sin(3))(1 — cos(1)). a 
EXERCISES 
12.4.1. Evaluate each of the following integrals. 
2 pr/4—x2 
a) / i sin(x? + yr) dy dx 
0 JO 


1 x 
b) / i Jey —y2)? dy dx 
o Jo 
b x 
c) [ [fers aan, O0<a<b 
a 0 


12.4.2. For each of the following, find [/, f dA. 


a) f(x, y) = cos(3x? + y?) and E is the set of points satisfying x? + 
ype 1: 


12.4.3. 


12.4.4. 


12.4.5. 


12.4.6. 


12.4.7. 


Section 12.4 Change of Variables 515 


b) f(x, y) = yJ/x — 2y and E is bounded by the triangle with vertices 
(0, 0), (4, 0), and (4, 2). 


For each of the following, find ///;, f dV. 


a) f(x, y,z) =z and E is the set of points satisfying x* + y* +z* < 6 
and z>x7+y?. 

b) f(x, y,z) = e and E is the set of points satisfying x7 + y* +z < 9, 
x*+y?<Ilandz>0. 

c) f(x, y,z) = (x — y)z and E is the set of points satisfying x7 + y? + 


z<4,z> /x2+y?, andx > 0. 


a) Prove that the volume bounded by the ellipsoid 
2 2 2 
x y z 
epee 


is 4r.abc/3. 

b) Let a, b, c, d be positive numbers and r> < d?/(b? +c’). Find 
the volume of the region bounded by y? + z* = r?, x = 0, and 
ax+by+cz=d. 

c) Show that for any a > 0, the volume of the region bounded by the 
cylinders x* + z? = a? and y* + 2? =a? is 16a?/3. 


a) Compute |, ./x — yJ/x +2y dA, where E is the parallelogram 
with vertices (0, 0), (2/3, —1/3), (1, 0), (1/3, 1/3). 


b) Compute J’, /2x2 — 5xy — 3y? dA, where E is the parallelogram 
bounded by the lines y = x/3, y = (v—1)/3, y = —2x, y = 1—2x. 


c) Find 
i! / eO—DIOAD GA, 
E 


where E is the trapezoid with vertices (1, 1), (2, 2), (2, 0), (4, 0). 
d) Given fj (1 — x) f(x) dx =5, find 


1 x 
/ i fax syayae: 
0 0 


Suppose that V is nonempty and open in R” and that f: V > R” is 
continuously differentiable with Ar 4 0 on V. Prove that 


Vol(f(B; (xo))) 


Me Gay 


for every xo € V. 
Show that Vol is rotation invariant in R; that is, if @ is a rotation on 
R? (see Exercise 8.2.9) and E is a Jordan region in R’, then 


Vol(@(E)) = Vol(E). 
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12.4.8. 


12.4.9. 


12.4.10 |. 


12.4.11. 


a) Compute the Jacobian of the change of variables from spherical 
coordinates to rectangular coordinates. 
b) Assuming that Vol is translation and rotation invariant (see Exer- 


cises 12.1.7 and 12.4.7), verify the following classical formulas: the 
4 


volume of a sphere of radius r is mr, and the volume of a right 
circular cone of altitude h and radius r is mr*h/3. 
Let vj = (vj1,.-.,Ujn) € R”", j = 1,...,n, be fixed. The paral- 
lelepiped determined by the vectors v; is the set 
P(V1,.-+5 Vn) = {t1Vi +++++tnVn : tj € [0, 1]}, 


and the determinant of the v;’s is the number 


det(v),...,.Vn) = det [vjx] 


nxn” 


Prove that 
Vol(P(v1,...,Vn)) = | det(v1,..., Vn)I- 


Check this formula for n = 2 andn = 3 to see that it agrees with the 
classical formulas for the area of a parallelogram and the volume of a 
parallelepiped. 


This exercise is used in Section *12.6. 


a) Prove that the improper integral doe e-*” dx converges to a finite 
real number. 
b) Prove that if / is the value of the integral in part a), then 


N>oo 


/2 pN , 
I? = lim i er dr do. 
0 0 


c) Show that 
CO 
i ge Fae ME. 
0 2 
d) Let Q, represent the n-dimensional cube [—k, k] x --- x [—k, k]. 
Find 


3 = 2 
lim e IxI ax, 
k—>oo Or 


Let H Cc V Cc R’", with H convex and V open, and suppose that 
@:V—>R'isC!. 


a) Show that if E is a closed subset of H° and 
€n(X) := (x +h) — $(x) — Dd(x)(h),_ for x € V andh small, 


then €,(x)/||h|| > 0 uniformly on £, ash > 0. 
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b) Show that if R is a closed rectangle in H? and S := (D¢(x))7! 
exists for some x € R, then given ¢ > 0 there are constants 5 > 0 
and M > 0 and a function T (x, y) such that 


So g(x) —Sogly) =x—-y+T(x, y) 


for x, y € R, and ||T (x, y)|| < Me when ||x — y|| < 6. 

Use parts a) and b) to prove that if Ag is nonzero on V, x € H®, 
and « is sufficiently small, then there exist numbers C; > 0, which 
depend only on H, @, n, and e, anda6é > Osuch that C; > 1 
as ¢ > 0 and Vol(S o f(Q)) < C-|Q| for all cubes Q C H which 
contain x and satisfy Vol(Q) < 6. 

Use part c) and Exercise 12.4.9 to prove that if Ag is nonzero on 
V and x € H®, then given any sequence of cubes Q; which sat- 
isfy x € Q; and Vol(Q;) > Oas j — om, it is also the case that 


Vol($(Qj))/|Qj| > |Ag@)| as j > ov. 


Cc 


— 


d 


— 


*12.5 PARTITIONS OF UNITY 
This section uses results from Section 9.3. 


In this section we show that a smooth function can be broken into a sum of 
smooth functions, each of which is zero except on a small set, and use this to 
prove a global change-of-variables formula when the Jacobian is nonzero off a 
set of volume zero. This same technique can be used to prove the Fundamental 
Theorem of Calculus on manifolds (see [12], for example). 


12.52 Definition. 
Let f: R" > R. 


i) The support of f is the closure of the set of points at which f is nonzero; 
that is, 


spt f := {x e R": f(x) FO}. 


ii) A function f is said to have compact support if and only if spt f is a com- 
pact set. 


12.53 EXAMPLE. 
If 


joe )4 


then spt f = R. 
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12.54 EXAMPLE. 
If 
x € (0,1) 


x € (1,2) 
otherwise, 


fx)= 


oN Ke 


then spt f = [0, 2]. 


Since the support of a function is always closed, a function f on R” has com- 
pact support if and only if spt f is bounded (see the Heine—Borel Theorem). 

The following result shows that if two functions have compact support, then 
so does their sum (see also Exercises 12.5.1 and 12.5.2). 


12.55 Remark. /f f, g : R” — R, then 
spt (f + g) C spt f Uspt g. 
Proof. If (f + g)(x) £0, then f(x) 4 0 or g(x) 4 0. Thus 
{x eR": (f + 8)(x) £0} C {kKER": f(x) FO} U {x ER": g(x) 40}. 


Since the closure of a union equals the union of its closures (see Theorem 9.19 
or 10.31), it follows that spt (f + g) C spt f U spt g. a 


Let p € Nor p = oo. The symbol C? (R") will denote the collection of func- 
tions f : R’ > R which are C? on R" and have compact support. In particular, 
it follows from Remark 12.55 that if f; € C?(R") for j =1,..., N, then 


N 
> fi € CER"). 


j=l 


We will use this observation several times below. 

If f is analytic (a condition stronger than C®) and has compact support, then 
f is identically zero (see Exercise 12.5.3). Thus it is not at all obvious that 
Ce°(R”) contains anything but the zero function. Nevertheless, we shall show 
that Co°(R”) not only contains nonzero functions but has enough functions to 
“approximate” any compact set (see Theorem 12.58 and Exercise 12.5.6). 

First, we deal with the one-dimensional case. 


12.56 Lemma. 
For every a < b there is a function @ € C2°(R) such that $(t) > 0 fort € (a, b) 
and $(t) =0 fort € (a, b). 
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Proof. The function 


et 420 
ro={' t=0 


belongs to C°(R) and f)(0) = 0 for all j € N (see Exercise 4.4.7). Hence, 


e7'/-a)’ -1/t-b) st © (a,b) 
P(t) = i 
0 otherwise 


belongs to C°(R), satisfies @(t) > 0 for t € (a, b), and spt ¢ = [a, 5]. a 


Next, we show that there exists a nonzero C® function which is constant 
everywhere except on a small interval. 


12.57 Lemma. 
For each & > 0 there is a function wy € C®°(R) such that0 < Ww < 1onR, 
w(t) =O fort <0, and W(t) =1 fort > 6. 


Proof. By Lemma 12.56, choose ¢ € C°°(R) such that ¢(t) > 0 for t € (0, 5) 
and ¢(t) =O fort ¢ (0, 5). Set 


fo @(u) du 


yO) = . 
Jo ow du 

By the Fundamental Theorem of Calculus, y ¢€ C™(R), by construction 
0<wy <1,and 


0 t<0 
t= = | 
ve 1 t> 0. 


Finally, we use these one-dimensional C°® functions to construct nonzero 
functions in C2°(R"). 


12.58 Theorem. [C® VERSION OF URYSOHN’S LEMMA]. 

Let H be compact and nonempty, let V be open in R", and let H Cc V. Then 
there is anh € C2°(R") such that 0 < h(x) < 1 forallx € R", h(x) = 1 forall 
x € H,andspth CV. 


Proof. Let @ € C?°(R) satisfy ¢(t) > 0 fort € (—1,1) and g(t) = 0 for 
t ¢ (—1,1). For each e > O and each x € R’, let Q,(x) represent the 


n-dimensional cube 


Q.(x) = {y € R”: |y; —x;| < forall j =1,...,n}. 
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Set 
sey) = 4(=)...0(2). (44) 


€ 

and observe by Theorem 4.10 (the Product Rule) that g, is C°® on R”. By 
construction, ge(y) > 0 on R”, g-(y) > 0 for all y in the open ball B,;(0), and 
the support of g- is a subset of the cube Q,(0). In particular, g- € C2°(R”). 

We will use sums of translates of these g,’s to construct a C® function, 
supported on V, which is strictly positive on H. It is here that the compactness 
of H enters in a crucial way. 

For each x € H, choose € := €(x) > O such that O,(x) C V. Set 


hx(y) = gc(y — x), y eR’, 


and notice that hy > 0 on R"”, hy(y) > 0 for all y € B,(x), hx(y) = 0 for all 
y ¢ O,(x), and hy € C>°(R"). Since H is compact and 


Hc |) Bw), 


xeH 


choose points x; € H and positive numbers ¢; = e«(x;), j = 1,...,N, 
such that 


Hc Be, (X1) U+-- U Bey (Xn). 


Set O = Q.,(x1) U...U Oey (ky) and f = hy, +... +/Axy. Clearly, Q is 
compact, Q c V, and f isC® on R". If x ¢ Q, then x ¢ Qe, (xj) for all j; 
hence, f(x) = 0. Thus spt f C Q. If x € H, thenx € Be (xj) for some j; 
hence, f(x) > 0. It remains to flatten f so that it is identically 1 on H. This is 
where Lemma 12.57 comes in. 

Since f > 0 on the compact set H, fhas a nonzero minimum on H. Thus 
there is a 6 > 0 such that f(x) > 6 forx € H. By Lemma 12.57, choose 
w € C™(R) such that y(t) = 0 whent < 0, and w(t) = 1 whent > 6. Set 
h=wof. Clearly, h € C°°(R”), spth C Q C V,and, since f > don H, h=1 
on H. Finally, since 0 < w < 1, the same is true of h. Ez 


This result leads directly to a decomposition theorem for C® functions. 
12.59 Theorem. [C® PARTITIONS OF UNITY]. 
Let Q C R" be nonempty and let {Va}aea be an open covering of Q. Then there 
exist functions @; € C3°(R") and indices a; € A, j €N, such that the following 


properties hold. 


i) $j =O forall j €N. 


ii) spt ¢j C Vu; forall j €N. 
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iii) Y° $)(x) = 1 forall x € Q. 


j=l 


iv) If H is anonempty compact subset of Q, then there is anonempty open set 
W > H and an integer N such that ¢;(x) = 0 forall j => N andx € W. In 
particular, 


N 
> 4) =1 forallxe W. 
j=l 


Proof. For each x € Q, choose a bounded open set W(x) and an indexa € A 
such that 


x € W(x) C W(x) C Vy. 


Then W = {W(x) : x € Q} is an open covering of Q and, by Lindeldf’s Theo- 
rem, we may suppose that W is countable; that is, W = {Wj} jen. 
By construction, given j €N, there is an index a; € A such that 


Wj CW; ae Vou; 


Choose by Theorem 12.58 functions h; € CO°(R”) such that 0 < h; < lon 
R’, hj =1 on Wj, and spt h; C Va; for 7 € N. Set ¢; = h, and for j > 1, set 


oj) =(—h)...(1—hj_hj. 


Then ¢; > 0 on R", and ¢; € C&°(R") with spt 6; C spth; C Vu; for j €N. 
This proves parts i) and ii). 
An easy induction argument establishes 


k 


56; =1-(—hy)...(1— hy) 


j=l 


fork €N. Ifx € Q, then x € Wj, for some jo so 1 — hj, (x) = 0. Thus 


k 
> °4j@) =1-0=1 


j=l 


fork > jo. If H is a compact subset of Q, then H C W, U---U Wy for some 
NEN. If W = W, U---U Wy, then x € W implies hy (x) = 1 for some 
1 <k <N; that is, 6;(x) = 0 for all j > N. Hence, 


N lee) 
Y 2 6)(x) =) o)(%) =1 
j=l j=l 


for allx € W. |_| 
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A sequence of functions {¢;}jen is called a (C°) partition of unity on Q sub- 
ordinate to a covering {Vy}eea if and only if Q and all the V,’s are open and 
nonempty, the @;’s are all continuous with compact support and satisfy state- 
ments i) through iv) of Theorem 12.59. By a C? partition of unity on Q we shall 
mean a partition of unity on Q whose functions ¢; are also C? on Q. By The- 
orem 12.59, given any open covering V of any nonempty set 2 C R” and any 
extended real number p > 0, there exists a C”? partition of unity on Q subordi- 
nate to V. 

C? partitions of unity can be used to decompose a function f into a sum of 
functions f; which have small support and are as smooth as f. For example, let 
f be defined on a set 2, {@j}jen be aC? partition of unity on Q subordinate to 
a covering {V;}jen, and f; = f¢;. Then 


f®% = f®% Oj) => fWE;W =D) fj 
j=l 


j=l j=l 


for allx € Q. If f is continuous on Q and p > 0, then each f; is continuous on Q; 
if f is continuously differentiable on Q and p > 1, then each /f; is continuously 
differentiable on Q. Thus, f can be written as a sum of functions f; which are 
as smooth as f. This allows us to pass from local results to global ones; for 
example, if we know that a certain property holds on small open sets in Q, then 
we can show that a similar property holds on all of Q by using a partition of 
unity subordinate to a covering of Q which consists of small open sets. 

To illustrate the power of this point of view, we now show that the integral 
can be extended from Jordan regions to open bounded sets, even though such 
sets are not always Jordan regions. This extension is a multidimensional version 
of the improper integral. (The proofs Theorems 12.63 and 12.64 are based on 
Spivak [12].') 

STRATEGY: The idea behind this extension is fairly simple. Let V be a bounded 
open set and let f be locally integrable on V; that is, f : V > R is integrable 
on every closed Jordan region H Cc V. For eachx e€ V, choose an open Jordan 
region V(x) so small that x € V(x) Cc V. [e.g., V(x) could be an open ball.] 
Then {V(x)}xey is an open covering of V, and by Lindeldf’s Theorem it has a 
countable subcover, say V = {Vj}jen. Let {$j}jen be a partition of unity on 
V subordinate to V. Since f is locally integrable on V, each f¢; is integrable. 
Since f = ey fj), it seems reasonable to define 


[ roax=y | Ff (x); (x) dx. 
us qa 


Before we can proceed, we must answer two questions: Does this series con- 
verge? And if it does, will its value change when the partition of unity changes? 
The next two results answer these questions. 


IM. Spivak, Calculus on Manifolds, (New York: W. A. Benjamin, Inc., 1965). 
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Let V be a bounded open set in R" and let V = {V;}jen be a sequence of 
nonempty open Jordan regions in V which satisfies 


Co 
v= yj. 
j=l 


Suppose that f : V — Ris bounded on V and integrable on each V;. If {@j} jen 
is any partition of unity on V subordinate to the covering V, then 


a / $j (*) f(x) dx (45) 
jale 


converges absolutely. 


Proof. Let R be an n-dimensional rectangle containing V and M = 
supxey | f (x)|. Since ¢; is supported on V;, the function ¢; f is integrable on 
V;. Moreover, if E = uN V; we have 


if $j (%) f (0) dx 
Vj 


N 
j=l 


N 
< Bt |b) (x) f (x)| dx 
j=l 


N 
= | Sle Fool ax 
Bai 
N 
< u | -16)(x)| dx < M Vol(E) < MRI < oo. 
Eo 


Therefore, the series in (45) converges absolutely. | 


The value of the series in (45) depends neither on the partition of unity chosen 
nor the covering V. 


12.61 Lemma. 
Let V be a bounded, nonempty, open set in R". Suppose that V = {V;}jen and 
W = {Wy }een are sequences of nonempty open Jordan regions in R” such that 


Suppose further that f : V — Ris bounded and locally integrable on V. If 
{¢;} jen IS a partition of unity on V subordinate to V and {Wx}xen iS a partition 
of unity on V subordinate to W, then 
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Df emre@dx= Sf noose ax (46) 
jal’ “i kee k 


Proof. By Lemma 12.60, both sums in (46) converge absolutely. By Exer- 
cise 12.5.5, {$; Wx} j,ken 1S a partition of unity on V subordinate to the covering 
{Vj al Wihiken: Thus 


ps > | scone F00 dx 


j=1k=1 


n 


also converges absolutely. Fix j € N. Since spt@; is compact, choose N <€ N 
so large that (x) = 0 fork > N and x ¢€ spt¢;. Hence, 


N 
/ pj (x) f(x) dx = i $j (x) Ss We (X) f (x) dx 
ie Yi k=1 
= yay es) bj) Wx(x) f(x) dx 
= Be pj (x) We (xX) f(x) dx. 


Thus 


| —| dx. 
as pj (x) f(x) dx mp ee bj (x) WE(X) f (x) dx 


j=lk=l 


Reversing the roles of j and k, we also have 


ae W(x) f(x) dx = o> a ee (We x) f (x) dx. 


k=1 j=l 


Since these series are absolutely convergent, we may reverse the order of 
summation in the last double series. a 


Using Lemma 12.61, we define the integral of a locally integrable function f 
over a bounded open set V as follows. 
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12.62 Definition. 


Let V be a bounded, nonempty, open set in R” and let f : V > R be bounded 
and locally integrable on V. The integral of f on V is defined to be 


WiN= | i0oFe ax, 
jai’ Vi 


where {@;} jen iS any partition of unity on V subordinate to an open covering 
VY = {V;}jen such that each V; is anonempty Jordan region and 


Co 
v= yj. 
j=l 


The following result shows that this definition agrees with the old one when 
V is a Jordan region. Thus, we shall use the notation J vy f(s) dx for Ty (f). 


12.63 Theorem. /f E is a nonempty, open Jordan region in R" and f : E> R 
is integrable on E, then 


i. f(x) dx = Ig(f). 
E 


Proof. Let ¢ > 0. Since E is a Jordan region, choose a grid G = {Q),..., Qp} 
of some n-dimensional rectangle R > E such that 


yd, 1@el<e. (47) 


OpNaEZD 


Let 


HS) Oe 
Q 


eCE 


Clearly, H is compact and by (47), Vol(E\H) < « (see Exercise 12.1.6d). 

Set M = supyer | f(x)|. Let {Rj}jen be a sequence of rectangles such that 
Rj CEandE= Uja1 R’, and let {¢;}jen be a partition of unity on E subor- 
dinate to V = {Ro} jeN- Since H is compact, choose N, € N such that ¢;(x) = 0 
for j > N; andx € H. Then, for any N > Nj, we have 
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N 
[ F ax-Yf ; (x) f (x) dx 
E ja Ri 


N 
— f f (x) ax- >of bj (x) f (x) dx 
E jae 


N 
< | Fe) — Yee rooiax 


j=l 
N 
em f 1-Y ojo dx 
E ‘=I 
< MVOo\(E \ H) < Me. 
We conclude that /z(f) exists and equals fe pe f(x) dx. a 
We now prove a change-of-variables formula valid for all open bounded sets. 


12.64 Theorem. Suppose that V is a bounded, nonempty, open set in R", that ¢ : 
V — R’ is 1-1 and continuously differentiable on V, and that @(V) is bounded. 
If Ag 4 00n V, then 


/ f(u) au= [ f (G(x) |Ag(®)| dx, 
o(V) V 


for all bounded f : 6(V) — R, provided f is locally integrable on $(V). 


Proof. For each a € V, choose by Theorem 12.45 an open rectangle W(a) 
such that W(a) c V and 


i f(u) au= [ f (P(x) |Ag(X)| dx. (48) 
¢(W(a)) W(a) 


Set W = {W(a)}acv. Then W is an open covering of V. By Lindeldf’s Theo- 
rem, we may assume that W = {W,}jen. Let {¢;}jen be a partition of unity 
on V subordinate to W; that is, a sequence of C® functions such that 


e,) 
sptg; CW) CV, JEN, and J oj) =1 
j=l 


for allx € V. By Theorem 12.10, each @(W;) is a Jordan region. By The- 
orem 11.39, each ¢(W;) is open. And by Exercise 12.5.4, {; 0 $ '}ien is 


a partition of unity on ¢(V) subordinate to the open covering {@(W;)} jen. 
Hence, by Definition 12.62 and (48), 
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[ ite dar a ($; ob ')(u) fu) du 
o(V) j=l o(W;) 
= De bj (x) f (G(x) |Ag(X)| dx 
j=l’ Mi 


= i F (G(x) |Ag(s)| dx. a 


Finally, we are prepared to prove a change-of-variables formula for functions 
whose Jacobians are zero on a set of volume zero. 


12.65 Theorem. [CHANGE OF VARIABLES FOR MULTIPLE INTEGRALS]. 
Suppose that E is a Jordan region in R" and that ¢ : E — R" is I-I and 
continuous on E, and C! on E°. Suppose further that there exists a closed 
subset Z of E such that Vol(Z) = Vol(@(Z)) = 0. If @(0E) is of volume zero 
and if Ag(x) # 0 for allx € E° \ Z, then 


fa) dus / F(O(%)|Ag(x)| dx 
o(E) E 


provided f is integrable on $(E). 


Proof. By the proof of Lemma 12.9, d(@(£)) is a subset of @(£). Since this 
last set is of volume zero, it follows that @(£) is a Jordan region. 

Set V := E°\ Zand Eo := (E \ E°) UZ. Then V is a bounded open set, Eo 
is of volume zero, E =V UEp, and VN Ej = &. 

Since ¢ is 1-1 on E, @(E) = @(V) Ud(£p) is a disjoint decomposition of the 
Jordan region @(E). Moreover, by Lemma 11.39, the set @(V) is bounded and 
open. Since 


$(Eo) = $(E \ E°) UP(Z) C OE) U G(Z), 


a set of volume zero, @(Eo) is also a set of volume zero. We conclude by 
Theorems 12.23, 12.24, and 12.65 that 


fu) du = i. f(u) du 
o(E) o(V) 
= [ F(@(®)|Ag(~)| dx = a FOM)lAg@l dx. 


We close by noting that, as general as it is, even this result can be improved. If 
something called the Lebesgue integral is used instead of the Riemann integral, 
the condition that Ag 4 0 can be dropped altogether (see Spivak [12], p. 72). 


EXERCISES 


12.5.1. If f, g : R” — R, prove that spt(fg) C spt f M spt g. 
12.5.2. Prove that if f, g € CO°(R”), then so are fg and af for any scalar a. 
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*12.5.3. Prove that if f is analytic on R and f(xo) 4 0 for some xo € R, then 
f £C2(R). 

12.5.4. Suppose that V is a bounded, nonempty, open set in R” and that 
@ : V — R" is 1-1 and continuously differentiable on V with Ag 4 0 
on V. Let W = {W;}jen be an open covering of V and {;} jen be aC? 
partition of unity on V subordinate to W, where p > 1. Prove that 
{pj 0 @ |} jenisa C! partition of unity on ¢(V) subordinate to the open 
covering {@(W;)} jen. 

12.5.5. Let V be open in R” and V = {Vj}jen, W = {Wi}cen be coverings of V. 
If {6} jen is aC? partition of unity on V subordinate to V and {Wx}xen is 
aC? partition of unity on V subordinate to W, prove that {6 Wx} ),cen is 
aC? partition of unity on V subordinate to the covering {Vj N Wx}j,ken- 

12.5.6. Show that, given any compact Jordan region H C R’”, there is a se- 
quence of C® functions ¢; such that 


lim | $j; dV = Vol(H) 
R 


J>@w JRn 


*12.6 THE GAMMA FUNCTION AND VOLUME 


The last result of this section uses Dini’s Theorem from Section 9.8. 
In this section we introduce the gamma function and use it to find a formula for 
the volume of any n-dimensional ball and an asymptotic estimate of n!. 

Recall that if f : (0, co) > Ris locally integrable on (0, oo), then 


iso y 
7 f(t) dt = lim i f(t) dt. 
0 x>0+ Jy 

yoo 


In particular, it is easy to check that le e “ dt is finite for all a > 0. 


12.66 Definition. 


The gamma function is defined by 


CO 
T(x) =) tle dt, x € (0, 00), 
0 


when this (improper) integral converges. 


By definition, 
CO 
rd) Sih ear 1. 
0 


and 
CO [o,e) 2 
raya = f fe ar=2 | e du= Jn. 
0 0 


(We used the change of variables t = u? and Exercise 12.4.10.) It turns out that 
I(x) is defined for all x € (0, ov). 
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12.67 Theorem. For each x € (0, 00), P(x) exists and is finite, P(x +1) = xT (x), 
and Y'(n) = (n— 1)! forn EN. 


Proof. Write 


1 oo 
P(x) -|/ Bole de +/ tle! dt =: Kt+h. 
0 1 


By l’H6pital’s Rule, 


lim e/*1? =0 
too 
for all y € R. Hence, e~'t*—! < e~'/? for t large and it follows, from Theo- 
rem 5.43 (the Comparison Theorem), that /, is finite for all x € R. 
To show that J; is finite for x > 0, suppose first that x > 1. Then t*~! < 1 
for all t € [0, 1] and 


1 1 
1 

| pet abe] e'dt=1——<o. 
0 0 e 


Therefore, (x) is finite for all x > 1. Next, suppose that 0 < x < 1. Then 
x+1> 1,sol(x + 1) is finite. Integration by parts yields 


[o2) 1 tee! 46 1 oe) 1 
D(x) = pe! dt = [no + / te dt = T(x + VD. 
0 x x 


t=0 > 0 
Therefore, I(x) is finite when 0 < x < 1. 

This argument also verifies x(x) = T(x+1) for x € (0, oo). Since P'(1) = 1, 
it follows that [(n) = (n — 1)! foralln EN. a 


The gamma function can be used to evaluate certain integrals which cannot 


be evaluated by using elementary techniques of integration. 


12.68 Theorem. Jf x, y € (0, 00), then 


} FOr 
i) i. Cg ery age een 
0 Paty) 

and 

u/2 r r 
ii) - cos”*—! g sin?! — dy = EOE 

0 2P(x + y) 

In particular, 


ili) 


is i do Mk = 1)/2)P 0/2) 
0 P(k/2) 


holds for all integers k > 2. 
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Proof. To prove part i), make the change of variables v = u/(1 +u) and write 


1 oe) y-l x-l d 
/ ytd = vy! dv= / ( u ) (1 “ss 4) a 
0 0 l+u l+u (1 +u) 
~ 1 x+y 
=i, uw! (=) du. 
0 l+u 


It follows from two more changes of variables [s = t/(1 + u) and w = su] 
and Fubini’s Theorem that 


1 
Tat » | va —v)* | dv 
0 


love) ee) 1 x+y 
= / / i (—) ty—-le—t dt du 
0 Jo u 
CO CO 
= i uy! gtty-1e—su+)) ds du 
0 Jo 
CO CO 
=i glee (| uw! s%e—S4 au) ds 
0 0 
CO CO 
=) ols (| i teu aw) ds =T(x)T'(Qy). 
0 0 


To prove part ii) use the change of variables v = sin” y and part i) to verify 


m/2 I 1 r r 
i cos! y sin! y dy = =f pitas 
2 Jo 2r(x + y) 


Specializing to the case y = (k — 1)/2 and x = 1/2, we obtain part iii). a 


The connection between the gamma function and volume is contained in the 
following result. 


12.69 Theorem. [fr > 0 anda € R", then 
Ort ”!2 


Proof. By translation invariance (see Exercise 12.1.7) and Theorem 12.22, 
Vol(B;(a)) = f p | dx for B = B,(0). We suppose for simplicity that n > 2, and 
we introduce a change of variables in R” analogous to spherical coordinates. 
Namely, let 


X1 =PCOSY], xX2=PSING|COSP2, X3= P SING] SING? COS G3, ba. 
Xn—-| = PSING,...SiINGp_2cosO, and xX, = PSsing,...singy,_2 sind, 
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where 0 <p <r, 0< 6 < 2nm,and0 < 9; <7 for j =1,...,n —2. Aneasy 
induction argument shows that this change of variables has Jacobian 


n 3 


A := p""! sin"? yg, sin" ” @... sin’ Qn—3 SiN @p—2. (49) 


Hence, by Theorems 12.65 (or Theorem 12.46 and a limiting argument) and 
12.68i1i, 


Vol(B,(a)) = iu 1 dx 
B 


r 4 a 20 
=i / af i p"—| sin’? gy ...singy_2 dO dg... dYn_2 dp 
0 JO 0 Jo 
us 


2 n a 
eeu (| sin"? p a) vee (| sing a) 
n 0 0 


_ 2nr" T(n—1)/200/2) Tdn—2)/2QPa/2)  Tayra/2) 
on P(n/2) rGa= 17)? G7} 


Canceling all superfluous factors and substituting the value ./z for (1/2), we 
conclude that 


n n—2 n—n/2 
ewe a (" u)) _ orn 


T(n/2) )  n¥(n/2)' 


This formula agrees with what we already know. For n = | we have 


2rm/? 
Vol(B,(0)) = ———__. > 3, 
r(1/2) 
for n = 2 we have 
Vol(B,-(0, 0)) as 2 
(0) 5 = — ‘ 
: ard). 
and for n = 3 we have 
orsn3/2 Ord 3/2 


Vol(B;,(0, 0, 0)) = xr. 


33/2) (/2Fd/2) 3 


We close this section with an asymptotic estimate of n!. First, we obtain an 
integral representation for n!/(n"*+!/7e—"), 


12.70 Lemma. 
If o(x) =x —logx —1, x > 0, then 


n! eo = (l+t/./n) 
a= | aaa aca 
n?itl/2e-n ae 
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Proof. By Definition 12.66 and Theorem 12.67, we can write 


Co 


ni} =T(+1) a xe dx. 
0 


Making two changes of variables (first x = ny, then y = 1 + t/./n), we con- 
clude that 


n! 1 ak 6,2 ee ee 
ynti/ze—n val (=) e dx 
[o@) 
0 


CO CO 
a vi | e000) dy al eno tt/ Vm) gy - 
0 —J/n 


Next, we derive some inequalities which will be used, in conjunction 
with Dini’s Theorem, to evaluate the limit of the integral which appears in 
Lemma 12.70. 


12.71 Lemma. 
If o(x) =x —logx —1, x > 0, then 


(x — 1)¢'(x) —26(x) > 0, for0<x <1, 


and 
(x — 1)6'(x) —2¢(x) <0, forx > 1. 


Moreover, there is an absolute constant M > 0 such that 
(x) >M(x—-1)*,  for0<x <2, (50) 


and 


o(x)>Ma-1), forx >2. (51) 


Proof. Let w(x) = 2logx — x + 1/x and observe that (x — 1)@'(x) — 2@(x) = 
w(x). Since w/(x) = —(x — 1)?/x? < 0 for all x 4 1, Wis decreasing on 
(0, co). Since y(1) = 0, it follows that w > 0 on (0, 1) and yw < 0 on (1, ov). 
This proves the first pair of inequalities. 

To prove the second pair of inequalities, observe first that, by Taylor’s 
Formula, 


(x—1)?  @-1)? 
Dh. se 


o(x) = (1) + 9’) — 1) + ¢") 


for some c between x and 1. Thus $(x) > (x — 1)7/8 for all 0 < x < 2. Next, 
observe, since @(x) > 0 for x > 1 and #(x)/(x — 1) ~ lasx > ow, that 
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o(x)/(« — 1) has a positive minimum, say m, on [2, co). Thus (50) and © 
hold for M := min{m, 1/8}. 


Our final preliminary result evaluates the limit of the integral which appears 
in Lemma 12.70. 


12.72 Lemma. 
If o(x) = x —logx — 1 for x > 0, and F,(t) = e~"@"+t/V for n © N and 


> —./n, then 


[e,2) 


a 2 
lim F,(t) dt = / a hiege, 
Jn —0o 


noo J_ 


STRATEGY: The idea behind the proof is simple. By l’H6pital’s Rule, 


t td/(1+t i? t 1? 
yh) Cee ee ie ie ee 
n->0o Jn n>oo 2 1/.J/n 2 n>c 


so F(t) > ge as n — ov, for every t € R. Thus Sa F,,(t) dt should 


converge to ee eV /2dtasn > oo. To prove this, we break the integral over 
(—./n, 00) into three pieces: one over (—./n, —./a), one over (./a, 00), and one 
over (—./a, /a). Since et /2 is integrable on (—oo, oo), the first two integrals 
should be small for a sufficiently large. Once a is fixed, we shall use Dini’s 
Theorem on the third integral. Here are the details. 


Proof. Let « > 0 and observe that 


i F,,(t) ar [ et 2 at 
=i _ Jf 


<Khth+h+h 


We 
i (Faw) = eri) dt +/ et /2 at 
~Ja It\|>/a 


os) —Ja 
/ iFolar + | [F,(t)| dt 
us Lg 


for anya > Oandn EN, provided n > a. Hence, it suffices to prove that 
[I;| < ¢/4 for j = 1,2, 3,4, and n, a sufficiently large. 
Let M be the constant given in Lemma 12.71, and choose a > 0 so large 


that 
CO 
/ go gee } eM gt <= (52) 
Itl> Va 4 va 4 
and 
if el at <=, (53) 
Il>Va 4 
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By (53), |/2| < ¢/4. 
To estimate |/;| for j ¢ 2, fix t > —./a and consider the function G(x) = 
e*bU+t//*) x 5 0, By the Product Rule, 


Tee Ce eG eee ere 2 
aon (558 (1+) (1+) 


ox) 


=> ((y — 1)¢’(y) — 260)), 


where y = 1 + t/,/x. Thus by Lemma 12.71, G’(x) > 0 for x > a, —/a < 
t < 0, and G(x) < 0 forx > 0, t > O. It follows that for each 


t € (—./a, 0), Fr(t) eR asn > oo, and for each t € (0, 00), Fi(t) J eh /2 
as n —> oo. Hence, by Dini’s Theorem (Theorem 9.60), 


Ja Ja 
i F,(t) dt > / etl at 
_Ja _Ja 


asn — oo. Thus, we can choose an N € Nso large that n > N implies || < «¢. 
It remains to estimate |/;| for j = 3, 4. 
To this end, let n > max{N, a}. By (50) and (51), 


t 1? 
no{1+—)>nM—=Mr for—VJn <t < Vn, 
fh n 


and 


t t 
no (14) = aM = fort > Jn. 


Since n > a, it follows that 


60 ay 
| + [al = lFoidr + | F,(t)| dt 
Ji ~Vi 


a n 


< 


CO 
/ eo Me ar | e™ at 
Jasit|<.J/n Jn 


CO 
<| e-uP ar | eM dt. 
Itl>J/a Ja 


We conclude by (52) that |/3| + |J4| < ¢/2 as required. a 


12.73 Theorem. [STIRLING’S FORMULA]. 
Forn €N sufficiently large, n! ~ /2n (n"+1/2)e-": that is, 


: n! 
a J27 (n+1/2)e-n + 
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Proof. By Exercise 12.4.10 and the change of variables t = /2u, we have 


ee. 2 ~ 2 sg 2 
/ et! /? at = vif edu =2vi f ee” du=v2n. 
—0o 0 


—CoO 


Therefore, it follows from Lemmas 12.70 and 12.72 that 


n! 1 oe ‘ 
ia —no(1+t/ Vn) 
neo apart iter ninco 20 ee si 
: i » oP dy =I a 
=) —$—{ e => f 
J 210 —oo 


EXERCISES 
12.6.1. Show that 


00 1 
i Ior'e “ar =r (2). 
@ 4 
ie a ee 
og { ~ =2: 
0 Ss 


=oh k 


12.6.2. Show that 


12.6.3. For k € N, show that 


12.6.4. Show that the volume of a six-dimensional ball of radius r is r°2?/3, 
and the volume of a nine-dimensional ball of radius r is 16r?*4/945. 

12.6.5. Verify (49). 

12.6.6. Suppose that n > 2 and define an n-dimensional ellipsoid by 


x7 a oe 
E = (x1, ,Xn) 2 + 4 + t 2 < 1 
ay ay a; 
Prove that 
907.5. age"!* 
Vol(E) = eGlsiutnis = 
nT (n/2) 


12.6.7. Suppose that n > 2 and define an n-dimensional cone by 


‘Oa G 9 amen: 7p ee (h/r)./x5 +++) +x2 <x, <h}. 


Prove that 
Ohh lq @-D/2 


VOC) =a DGD: 
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12.6.8. Find the value of 
/ UT anh) 
B, (0) 
for each k EN. 


12.6.9. If f : B,(0) > R is differentiable with f(0) = 0 and ||V f(x)|| < 1 for 
x € B,(0), prove that the following exists and equals 0. 


lim f(x) | dx 
k>0oo Bi (0) 


12.6.10. a) Prove that I is differentiable on (0, 00) with 
Ce 
r(x) = / é- te Mosradi. 
0 


*b) Prove that I is C© and convex on (0, 00). 


CHAPTER = 13 


Fundamental Theorems of Vector 
Calculus 


This chapter is more descriptive and less rigorous than its predecessors. Our 
goal is to lay a practical foundation for calculus on manifolds. 


13.1 CURVES 


For a non-mathematician, a curve is a smooth line which bends, without cor- 
ners; a one-dimensional object with length but no breadth. Of course, this def- 
inition is too imprecise. It is also too restrictive. Our concept of a curve will 
include not only “smooth” objects such as the graphs of the function y = x? 
and the relation x* + y* = 1, but also objects with “corners” such as the graph 
of y = |x|. 

Recall that if 7 C Rand ¢@: J > R”, then the image of J under ¢ is the set 


o(1) = {x € RR” :x = ¢(t) for some t € J}. 


Also recall that, given a, b € R” with b # 0, the image of R under @(t) := a+tb 
is the straight line through a in the direction of b. This is the simplest type of 
curve in R”. 

A naive attempt to define a general curve in R” is to insist that it simply be the 
image of an interval under some continuous function @ : R > R”. It turns out 
that this definition is too broad. There are continuous functions (called “space- 
filling curves”) which take the unit interval [0, 1] onto the unit square [0, 1] x 
[0, 1] (see Boas [2]). One way to fix this definition is to use homeomorphisms 
(i.e., continuous functions whose inverses are also continuous). Since we are 
interested primarily in the differential structure of curves, we take a different 
approach, using differentiable functions to define curves (see Definition 13.1 
below). 

We begin by extending the definition of partial differentiation to include func- 
tions defined on nonopen domains. Let m,n, p € N, and E be anonempty subset 
of R”. A function f : E > R” is said to be C? (on £) if and only if there is an 
open set V > E and a function g : V — R” whose partial derivatives of orders 
J < p exist and are continuous on V such that f(x) = g(x) for all x € E. In this 
case we define the partial derivatives of f to be equal to the partial derivatives 
of g; for example, df; /dx;(x) = 0g;/0x,(x) fork = 1,2,...,n, j =1,2,...,m, 
and x € E. A function f : E > R” is said to be C® (on E£) if and only if f is C? 
on E for all p € N. Notice that this agrees with Definition 4.6 when n = 1. Also 
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notice that the Mean Value Theorem and the Inverse Function Theorem hold 
for functions in C!(E). 
Henceforth, p will denote an element of N or the extended real number oo. 


13.1 Definition. 


A subset C of R” is called a C? curve (in R”) if and only if there is a nonde- 
generate interval J (bounded or unbounded) and a C? function @ : J] > R” 
such that @ is 1-1 on J° and C = ¢(J). In this case, the pair (@, /) is called a 
parametrization of C, and C is called the trace of (@, 1). The equations 


xj = P(t), teJ, j=l,...,m, 


are called the parametric equations of C induced by the parametrization (@, J). 


Thus the straight line through a in the direction of b is a C® curve with 
parametrization @(t) :=a+¢tb, J/=R. 
For most applications, we must assume more about curves. 


13.2 Definition. 


AC? curve is called an arc if and only if it has a parametrization (¢, 7) where 
I = [a,b] for some a,b € R. In this case, we shall call (a) and $(b) the 
endpoints of C. An arc is said to be closed if and only if ¢(a) = ¢(b). 


Thus L(a; b), the line segment from a to b, is an arc. The circle x+y =a 


is an example of a closed arc (see Example 13.4 below). 

A closed arc is said to be simple if and only if it does not intersect itself 
except possibly at its endpoints. Simple closed arcs are also called Jordan curves 
because of the Jordan Curve Theorem. This theorem states that every sim- 
ple closed arc C in R? separates R? into two pieces, a bounded connected set 
E and an unbounded connected set Q, where DE = 0Q = C. However, as 
W. F. Osgood! discovered, the set E is not necessarily a Jordan region. 

Before we start developing a theory of curves, we look at several additional 
examples to see how broad Definitions 13.1 and 13.2 really are. First, we show 
that curves, as defined in Definition 13.1, include graphs of C? real functions. 


13.3 EXAMPLE. 


Let J be an interval and let f : 1 ~ R be aC? function. Prove that the graph of 
y = f(x) on/ isaC? curve in R’. 


I“ Jordan Curve of Positive Area,” Transactions of the American Mathematical Society, 
vol. 4 (1903), pp. 107-112. 
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Proof. Let @(t) = (t, f(t). Then ¢ is C? and 1-1 on J, and @(/) is the graph 
of y = f(x) as x varies over J. [We shall call (@, 1) the trivial parametrization 
of y = f(x).] a 


By an explicit curve we mean a curve of the form @(/), where either @(t) = 
(t, f(t)) or b(t) = (f(t), t) for some C? function f : J > R. Notice, then, that 
an explicit curve is a set of points (x, y) which satisfy y = f(x) [respectively, 
x = f(y)] for some real C? function f. 

We have just proved that every explicit curve is a curve in R*. The following 
result shows that the converse of this statement is false. 


13.4 EXAMPLE. 


Prove that the circle x? + y* = a? isa C® Jordan curve in R?. 


Proof. This circle can be described in polar coordinates by r = a (i.e., in 
rectangular coordinates by x = acos6, y = asin@). Set $(t) = (acost, a sint) 
and J = [0,27]. Then ¢ is C®, 1-1 on [0, 277), and @(J/) is the set of points 
(x, y) € R* such that x? + y* = a*. [The trace of this parametrization is 
sketched in Figure 13.1. The arrow shows the direction the point @(t) moves 
as t gets larger. For example, (0) = (a, 0) and $(z/2) = (0, a).] i 


#Y 


FIGURE 13.1 


Recall that the graph of a C” function on an interval is “smooth” (i.e., has a 
tangent line at each of its points). The following example shows that this is not 
the case for a general C? curve. 


13.5 EXAMPLE. 


Let o(t) = (cost, sin’ t) and J = [0,27]. Show that (@, J) is a parametriza- 
tion of a C® Jordan curve in R? which has “corners.” (This curve is called an 
astroid.) 


540 Chapter 13 Fundamental Theorems of Vector Calculus 


Proof. Clearly, ¢ is C® on J and 1-1 on [0, 277). Let x = cos’ t and y = sin’ t 
and observe by a double-angle formula that 


3 1 

2 2 2 
=- 2t -, 
x" +y fo eG 


Hence, \/x?2 + y? varies from a maximum of 1 (attained when t = 0, 7/2, 
x, 3x /2, 27) toa minimum of 1/2 (attained when t = 7/4, 37/4, 57/4, 77/4). 
Since J is connected and ¢ is differentiable, hence continuous, the set @(/) 
must also be connected. Plotting a few points, we see that @(/) is a four- 
cornered star, starting at (1, 0) and moving in a counterclockwise direction 
from 0B, (0, 0) to 0B, /2(0, 0) and back again (see Figure 13.2). As ¢ runs from 
0 to 27, this curve makes one complete circuit. a 


FIGURE 13.2 


We have enough examples to begin to explore the theory of curves. First, we 


define the “length” of a curve. (For a geometric justification of this definition, 
see Theorem 13.17 below.) 


13.6 Definition. 


Let C be aC? arc and (@, /) be one of its parametrizations. The arc length of 
C, as measured by (@, /), is defined to be 


HO / lel at. 


For example, let (¢, J) be the parametrization of the circle of radius a given 
by Example 13.4. Since ||@’()|| = a for all t € [0, 277], it is easy to check that 
L(C) = 27a, exactly what it should be. This also demonstrates why we insisted 
that parametrizations be 1-1 on the interior of their domains. If ¢ were not 1-1 
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on (0, 277), some part of the circle might be traced more than once, giving an 
inflated value of its arc length. 

Since ¢ is C? on a closed, bounded interval /, then ||@’|| is integrable on J by 
Theorem 12.21. Hence, L(C) is finite for any parametrization of aC? arc C. This 
is not necessarily the case if C is merely the continuous image of an interval (the 
space-filling curve is continuous but its length is infinite) or if C is the image of 
an open interval (see Exercise 13.1.4). 

When C is an explicit curve, say y = f(x) on [a, b], and (@, /) is the trivial 
parametrization, Definition 13.6 becomes 


b 
Lc) = | y lt (f(x)? dx. 


This agrees with the formula for arc length introduced in elementary calcu- 
lus texts. 

Before we continue, it is important to realize that every curve has many differ- 
ent parametrizations. For example, the line segment {(x, y) € R? : y =x, 0 < 
x < 1} is the trace of @(t) = (t,t) on (0, 1], of W(t) = (t/2, £/2) on (0, 2], 
and of o(t) = (1/t, 1/t) on [1, co). Although these functions trace the same 
line segment, each of them traces it differently. The function w traces the line 
“twice as slowly” as @, and o traces the line “backward” from @. Therefore, a 
parametrization (¢, 7) of C is a specific way of tracing out the points in C. 

At this point, it is natural to ask, Does the value of arc length, L(C), remain 
the same if we use different parametrizations of C? To answer this question, we 
begin by showing that any two parametrizations of the same arc are related by 
a one-dimensional change of variables tT. 


13.7 Remark. Let J, J be closed bounded intervals and let @ : I > R” be 1-1 
and continuous. Then (1) = w(J) for some continuous y : J > R"” if and only 
if there is a continuous function t from J onto I such that fb = $ oT. 


Proof. Since I is closed and bounded and ¢ is 1-1 and continuous on J, $7! is 
continuous from ¢(/) onto J (see Theorem 9.58 or 10.64). Since w(J) = @(/), 
it follows that t := $7! o w is continuous from J onto /. 

Conversely, if t is any continuous function from J onto J, then wy = @ oT is 
continuous from J onto $(J/); that is, W¥(J) = (J). i 


Thus if (@, 7) and (w, J) are C? parametrizations of the same arc and ¢ is 
1-1, then there is a continuous function t : J — I, called the transition from J 
to J, such that f = @ oT, or, equivalently, t = @! 0 w. It follows that if the 
transition is differentiable, then by the Chain Rule, 


Wu=¢(ctl))t'u), ued. (1) 


We are prepared to prove that the definition of arc length does not depend on 
the parametrization chosen provided the transition has a nonzero derivative. 
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13.8 Remark. /f (@, 1) and (Ww, J) are C? parametrizations of the same arc, if 
w = 60, where t takes J onto I and satisfies t'(u) # 0 for allu € J, then 


fivo a= [ IIp'(u)|| du. 
I J 


Proof. By hypothesis, t(J) = I. Hence, it follows from (1) and the Change- 
of- Variables Formula (Theorem 12.46) that 


[igo ar= | lool a= f Ip’ (c(w)) || |e"(u)| du =f Iv'(u)|| du. 
I t(J) J J 


We note that the condition t’ 4 0 can be relaxed at finitely many points in J 
(see Exercise 13.1.8). 

One productive way to think about different parametrizations of a curve C is 
to interpret @(t) as the position of a particle moving along C at time r. Different 
parametrizations of C represent different flight plans, some faster, some slower, 
some forward, and some backward, but all tracing out the same set of points. 


13.9 Remark. Let (¢, 1) be a parametrization of a C? curve, and let xo = (to) 
for some to € I°. If b(t) represents the position of a moving particle at time t, then 
||’ (to) || is the speed of that particle at position xy and, when (to) 4 0, (to) is 
a vector which points in the direction of flight at xo. 


Proof. Let to € I° and notice that, for each sufficiently small h > 0, the 
quotient 


b(t +h) — P(t) 
h 
is a vector which points in the direction of flight along the curve C (see 


Figure 13.3). To calculate the speed of the particle, define the natural 
parameter of the curve C := $(/) by 


t 
s=eai= [ Ip'(u)|| du, tt € [a, db). (2) 


By the Fundamental Theorem of Calculus, ds/dt = ¢'(t) = ||@'(t)||. Thus, the 
change of arc length s with respect to time at fo (i.e., the speed of the particle 
at xq) is precisely ||@’(to)||. | 


(ty +h)- oo) 
T 07 oe 
C h 


O(tg+h) 


Oo) o(1) 


FIGURE 13.3 
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By elementary calculus, every explicit C? curve is “smooth” (i.e., has a tangent 
line at each of its points). The astroid (Example 13.5) shows that a general C? 
curve does not have to be smooth at every point. 

Is there an easy way to recognize when a general C? curve has a tangent line 
(in the sense of Definition 11.21) at a given point on its trace? To answer this 
question, let (@, /) be the parametrization of the astroid given in Example 13.5, 
and notice that ¢’(t) = 0 when t = 0, 2/2, 2, 3/2, 27; that is, exactly at the 
points where the astroid @(/) fails to have a tangent line. (Notice that if we 
use the flight plan analogy, this condition makes much sense. It is impossible to 
draw a curve at a corner without pausing to make the direction change, that is, 
without making the velocity of the drawing device zero.) 

Could the answer to our question be this simple? Does a curve with para- 
metrization (g, /) have a tangent line at each point where ¢/ 4 0? 


13.10 Remark. Jf (¢, J) is a parametrization of a C? curve C in R?, and 
¢' (to) 0 for some to € I°, then C has a tangent line at (xo, yo) := (to). 


STRATEGY: By elementary calculus, the graph of every differentiable function 
has a tangent line at each of its points. The curve C is given by x = @1(t), y = 
¢2(t). If we could solve the first equation for rf, then by the second equation C is 
an explicit curve: y = ¢2 0 (x). Thus we must decide: Is ¢, ' differentiable? 
This looks like a job for the Implicit Function Theorem. 


Proof. Let (¢1, ¢2) represent the components of @. Since $’(t9) 4 0, we may 
suppose that ¢) (to) 4 0. Set F(x, t) = #1 (t) — x and observe by the Implicit 
Function Theorem that there is an open interval Jo containing xo and a con- 
tinuously differentiable function g : Jo — I such that ¢)(g(x)) = x for all 
x € Jo and g(xo) = fo. Thus the graph of y = f(x) := ¢2 0 g(x), x € Jo, 
coincides with the trace of @ on g(Jo); that is, near (x9, yo). It follows that C 
has a tangent line at (xo, yo). a 


Accordingly, we make the following definition. 


13.11 Definition. 
Let (@, /) be a parametrization of aC? curve C. 


i) (g, J) is said to be smooth at to € I if and only if $’(to) 4 0. 
ii) (¢, 1) is called smooth if and only if it is smooth at each point of J, in which 
case we call ¢’ the tangent vector of C induced by (@, /). 
iti) A curve is called smooth if and only if it has a smooth parametrization, 
unless it is a closed arc, in which case we also insist that one of its smooth 
parametrizations (w, [c, d]) satisfies W’(c) = w'(d). 


By definition, then, if a curve C has a smooth parametrization, then C is 
smooth. The converse of this statement is false, even for arcs. 
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13.12 Remark. Every smooth arc has nonsmooth parametrizations. 


Proof. Let (@, [a, b]) be a smooth parametrization of a smooth arc C. We 
may suppose (by a preliminary change of variables) that 0 € (a,b). Then 
w(t) := o(0), J = (a, vb) is a parametrization of C. It is NOT smooth, 
however, since w’(t) = @'(t*) - 3t7 = 0 whent = 0. a 


This raises another question: When does a change in parametrization pre- 
serve smoothness? To answer this question, suppose that (@, J) and (W, J) are 
parametrizations of the same curve, with @ 1-1 and (@, J) smooth. If the transi- 
tion t, from J to /, is differentiable, then, by (1), (w, J) is smooth if and only if 
t'(u) # 0 for all u € J. This leads us to the following definition. 


13.13 Definition. 


Two C? parametrizations (¢, 1), (Ww, J) are said to be smoothly equivalent if 
and only if they are smooth parametrizations of the same curve, and there is a 
C? function t, called the transition from J to J, such that ¥ = @ot, t(J) = 7, 
and t’'(u) 4 Ofor allu € J. 


Thus, by Remark 13.8, the arc length of a curve is the same under smoothly 
equivalent parametrizations. 

Notice that since t’ is continuous and nonzero, either t’ is positive on J or t’ 
is negative on J. Hence, by Theorem 4.17i, a transition t between two smoothly 
equivalent parametrizations is always 1-1. 

The following integral can be interpreted as the mass of a wire on a curve with 
density g (see Appendix E). 


13.14 Definition. 


Let C be a smooth arc in R” with parametrization (¢, /), and let g : C > R be 
continuous. Then the /ine integral of g on C is 


[eas= [ eooido dt. (3) 


For an explicit curve C given by y = f(x), x € [a, b], this integral becomes 


b 
[sas= | HG. f@).A1 LIP @P ax. 


We note that by Definition 13.6, the line integral (3) equals the arc length of C 
when g = 1. This explains the notation ds. Indeed, the parameter s represents 
arc length [see (2) above], so, by the Fundamental Theorem of Calculus, ds/dt = 
|¢’(t) ||. Hence, the Leibnizian differential of s satisfies ds = ||@’(t)|| dt. We also 
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note that the line integral of a function g on a curve is the same under smoothly 
equivalent parametrizations (see Exercise 13.1.8). 

Since a line integral is a one-dimensional integral, it can often be evaluated 
by the techniques discussed in Chapter 5. 


13.15 EXAMPLE. 


Find /. g ds, where g(x,y) = 2x + y, C = $(/), O(t) = (cost, sint), and 
I =(0,2/2I. 


Solution. Since ||@'(t)|| = ||(— sint, cost)|| = 1, we have 


m/2 
[sa=f (2cost + sint) dt = 3. |_| 
Cc 0 


For even the simplest applications, we must have a theory rich enough to 
handle curves, like the boundary of the unit square 0([0, 1] x [0, 1]), which are 
not smooth but a union of smooth pieces. Consequently, we extend the theory 
developed above to finite unions of smooth curves as follows. 

A subset C of R” is called a piecewise smooth curve (respectively, a piecewise 
smooth arc) if and only if C = uy Cj, where each C; is a smooth curve (respec- 
tively, arc) and for each j # k either C; and C, are disjoint or they intersect at 
a single point. Thus a piecewise smooth curve might consist of several disjoint 
smooth pieces, like the boundary of an annulus 0 < a* < x? + y* < b’, or sev- 
eral connected pieces with corners, like the boundary of the perforated square 
([0, 3] x [0, 3])\(1, 2] x [1, 2]). 

Let C = Uj_,Cj be a piecewise smooth curve. By a parametrization of C we 
mean a collection of smooth parametrizations (@;, 1;) of Cj. Two parametriza- 
tions Ui (bj; Tj) and Ui ;, Jj) of C are said to be smoothly equiva- 
lent if and only if (@ fe dp) and (w jo Jj) are smoothly equivalent for each 
j € {i,..., N}. Finally, if C is a piecewise smooth arc, then the arc length of 
C is defined by 


N 
LOS) HG), 


j=l 


and the /ine integral on C of a continuous function g : C — R is defined by 


N 
[ew=d gds. 
G jal Cj 


13.16 EXAMPLE. 


Parametrize the boundary C of the unit square [0,1] x [0,1] and compute 
te g ds, where g(x, y) =x*+y°. 
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Solution. C has four smooth pieces which can be parametrized by 
@O=6,0), @O=A.9), BO=6)D, 4 = (0,4), 


for t € [0, 1]. Since Id; (t)|| = 1, we have, by definition, 


1 1 1 1 19 
[ea f Part fatreyat f @+vart | rdt=-—. 
C 0 0 0 0 6 


We close this section with a geometric justification of Definition 13.6 which will 
not be used elsewhere. 


(4) 


(to) 


FIGURE 13.4 


The arc length of some non-C? curves can be defined by using line seg- 
ments for approximation (see Figure 13.4). Namely, we say that a curve C with 
parametrization (@, /) is rectifiable if and only if 


k 
|C ||:= sup ME l@(t;) — Ot; Il : {t0, 1, -.., %} 18 a partition of J 
j=l 


is finite, in which case we call ||C|| the arc length of C. 
The following result shows that every C? arc is rectifiable, and the two defini- 
tions we have given for arc length agree. 


*13.17 Theorem. Jf C is aC? arc, then ||C|| is finite, and L(C) = ||C|l. 


STRATEGY: The idea behind the proof is simple. By the Mean Value Theorem, 
each term ||@(¢;) — @(t;-1)|| which appears in the definition of ||C|| is approxi- 
mately ||@'(¢;)||(t; — t;-1), a term of a Riemann sum of the integral of ||’ (t)]|. 
Thus, we begin by controlling the size of ||'(¢;)|I. 


Proof. Let ¢ > 0, write @ = (¢1, ¢2,..., @m), and set 


oe 1/2 
Fy si.ig hn) = (SWicoF] 


l=1 
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for (%1,...,%m) inthe cube J” := I x---xJ. By hypothesis, F is continuous on 
I”, and J” is evidently closed and bounded. Thus, F is uniformly continuous 
on J”; that is, there is a 6 > 0 such that 


: é 
x,yel™” and ||x—y\| <6 imply Bie laee Uiaesey p 


Let P = {uo,...,un} be any partition of 7. By Theorem 5.18, choose a 
partition Po = {to, t),..., %} of J finer than P such that ||Po|| < 5/./m and 


k 
[ison dt =; < SU llg' ety — 7-1) < : IIb’ (t)|| dt + - 
j=l 


Fix @ € {1,..., m}and j € {1,..., k}. By Theorem 4.15ii (the one-dimensional 
Mean Value Theorem), choose a point c;(¢) € [t;—1, t;] such that 


be(tj) — be(tj-1) = O(c; () (tj — tj-1). 


Since ||Pol| < 6//m, we have |F(t;,..., tj) — F(cj(1),-...¢j(m))| < e/(QIII). 
Since @'(t) = (6) (t), .... ¢},()), we also have F(t;,...,¢;) = ||o’(tj)|| and 


m 1/2 
F(cj(),...,¢e;@m)) (tj — tj-1) = (SWicenr) (t; — tj-1) 


l=1 


= ||O(t;) — o(tj-1) II. 
It follows that 
k e k k ‘ 
DIP EDIG —45- — 5 < Do Ie) — 6Gj-DI< Dd Dj — ja) +5. 
j=l j=l j=l 


Combining this double inequality with the preceding one, we obtain 


k 
/ Ie’ @ll dt —e < So lb) — O@j-DIl < / Ip") || dt +e. 


j=l 


Using the left-hand inequality and the definition of ||C ||, we have 


k 
L(C)-e= / IPI dt —€ < D6) — O(t;-DIl < ICI 


j=l 


It follows from Definition 13.6 that L(C) < ||C|]. On the other hand, since 
Po = {to,t,..-, ¢} is finer than P, it follows from the right-hand inequality 
that 
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N k 
Yo lec) — bull < Yo Ib) — ODI < fio dt +e. 


i=l j=l 


Taking the supremum over all partitions {uo,..., uv} of 7, we have 


Icll< 7 lel dt +e; 


that is, ||C|| < L(C). | 
EXERCISES 
13.1.1. Let w(t) = (asint,acost), o(t) = (acos2t,asin2t), J = [0, 27), and 


13.1.2. 


13.1.3. 


*13.1.4. 


13.1.5. 


13.1.6. 


J = [0,7). Sketch the traces of (W, J) and (o, J). Note the “direc- 
tion of flight” and the “speed” of each parametrization. Compare these 
parametrizations with the one given in Example 13.4. 

Let a,b € R”, b £ 0, and set g(t) = a+ tb. Show that C = ¢(R) is 
a smooth unbounded curve which contains a and a-+ b. Prove that the 
angle between $(t,) — @(0) and @(f2) — @(0) for any ft, t2 A OisOorz. 
Let J be an interval and f : J > R be continuously differentiable with 


IFO? +1f' OP? 40 


for all @ € J. Prove that the graph of r = f(@) (in polar coordinates) is 
a smooth C! curve in R’. 

Show that the curve y = sin(1/x), 0 < x < 1, is not rectifiable. Thus 
show that Theorem 13.17 can be false if C is not an arc. 

Sketch the trace and compute the arc length of each of the following. 


a) b(t) = (e~ cost, e” sint, e”), t € [0, ] 
b) 4y? = 9x? from (—2/3, 1) to (2/3, 1) 
c) o(t) = (ie. z 101 pl0l) ¢ € [0, 1] 

d) The asteroid of Example 13.5 


For each of the following, find a (piecewise) smooth parametrization of 
C and compute f.. g ds. 


a) C is the curve y = /25 — x2, x > 0, and g(x, y) = xy’. 

b) C is the portion of the ellipse x7/a* + y?/b? = 1, a, b > 0, which lies 
in the fourth quadrant and g(x, y) = xy. 

c) C is the intersection of the surfaces x* + y? = 4 and z = 2x?, and 
g(x, y,z) = V1 + yz. 

d) C is the triangle with vertices (0,0,0), (1,1, 0), and (0,0, 2), and 
B(x, y,2) = 2x+yt+z. 


13.1.7. 


13.1.8. 


13.1.9. 


13.1.10. 


13.1.11. 
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Let C be a smooth arc and g; : C > R be continuous for n € N. 


a) If gx — g uniformly on C, prove that es g.ds > le gds ask > o. 
*b) Suppose that {g;,} is pointwise monotone and that g, — g pointwise 
on C ask > ov. If g is continuous on ¢(/), prove that ee gpds > 
Sogds ask > o. 
Suppose that (@, /) is a parametrization of a smooth arc in R”, and that 
t: J > RisaC! function, 1-1 from J onto J. If t/(u) 4 0 for all 
but finitely many vu € J, Ww = @ot, and g: dU) > Ris continuous, 
prove that 


[some dt = [ sceomiv' wo du. 


[THE FoLtiuM or Descartes]. Let C be the piecewise smooth curve 
o( U fb), where I; = (—oo, —1), bb = (—1, oo), and 


Gi 3t 312 
— eae ae ey” 


Show that if (x, y) = @(¢), then x? + y? = 3xy. Sketch C. 


The absolute curvature of a smooth curve with parametrization (y, /) 
at a point x9 = W(fo) is the number 
O(t 
K(Xo) = lim ote) 
t>1 L(t) 


when this limit exists, where (rt) is the angle between W’(t) and w’ (to), 
and ¢(t) is the arc length of w(/) from w(t) to W(to). [Thus « measures 
how rapidly 6(t) changes with respect to arc length. ] 


a) Given a,b € R", b # 0, prove that the absolute curvature of the 
line A from a to b is zero at each point xp on A. 

b) Prove that the absolute curvature of the circle of radius r is 1/r at 
each point xg on C. 


Let C be a smooth C? arc with parametrization (@, [a, b]), and suppose 
that s = £(t) is given by (2). The natural parametrization of C is the pair 
(v, [0, L]), where 


v(s)=(@ol')\(s) and L=L(C). 


a) Prove that ||v’(s)|| = 1 for all s € [0, ZL] and the arc length of a 
subcurve (v, [c, d]) of C is d — c. (This is why (v, [0, L]) is called the 
natural parametrization.) 

b) Show that v’(s) and v”(s) are orthogonal for each s ¢€ [0, L]. 

c) Prove that the absolute curvature (see Exercise 13.1.10) of (v, [0, L]) 
at Xo = v(s) is K (Xo) = ||" (so) ||. 
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d) Show that if xo = $(to) = v(so) and m = 3, then 


IIo) x 6" (to) I 
II’ (to) II? 


e) Prove that the absolute curvature of an explicit C? curve y = f(x) 
at (xo, yo) under the trivial parametrization is 


(Xo) = ||v’(so) x v(so) || = 


___ly"@o)! 
(1+ (@0))2)3?' 


13.2 ORIENTED CURVES 


Every parametrization (@, /) of a smooth curve C determines a “direction of 
flight” along C; that is, determines the direction @(f) moves as ¢ increases on /, 
equivalently, the direction in which the tangent vector ¢/(t) points. This direc- 
tion is called the orientation of C induced by (@, 1). (The arrows in Figures 13.1 
and 13.2 above represent the orientation of the given parametrization.) 

If C is smooth and (@, J) is one of its smooth parametrizations, then the unit 
tangent vector of C at xp = (fo) is defined by 


' (to) 
lp’ (to) | 


Suppose that (@, 7) and (%, J) are smoothly equivalent parametrizations of 
the same curve with transition t. Since t’ is continuous and nonzero, either 
t/(u) > Oforallu € J or t’(u) < 0 for allu € J. In the first case, the vectors 
@'(t(u)) and w’(u) point in the same direction [see (1) in Section 13.1]; hence, 
these parametrizations determine the same orientation and same unit tangent. 
In the second case, the vectors $’(t(u)) and w’(u) point in opposite directions 
and, hence, determine different orientations and opposite unit tangents. Ac- 
cordingly, we make the following definition. 


T(xo):= 


13.18 Definition. 


Two parametrizations (@, /) and (w, J) are said to be orientation equivalent 
if and only if they are smoothly equivalent and the transition t from J to J 
satisfies t’(u) > O for allu € J. 


In practice, a curve and its orientation are often described geometrically. The 
reader must provide a parametrization which traces the curve in the prescribed 
orientation. Here are two typical examples. 


13.19 EXAMPLE. 


Find a smooth parametrization of the curve C in R®, oriented in the clockwise 
direction when viewed from high up on the positive z-axis, formed by intersect- 
ing the surfaces x? + 5y? = 5 and z = x’. 
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FIGURE 13.5 


Solution. The elliptical cylinder x? + Sy? = 5 intersects the parabolic cylinder 
z = x* to form a “sagging ellipse” (the shaded region in Figure 13.5 repre- 
sents that part of z = x? which lies inside the cylinder x? + Sy? = 5). Using 
x = J5sint, y = cost to incorporate clockwise motion around the ellipse 
x? + 5y? = 5, we see that z = x? = Ssin*t. Thus a smooth parametrization of 
C with clockwise orientation is @(t) = (/5sint, cost, 5sin?t) on J = [0,27]. 


13.20 EXAMPLE. 


Find a smooth parametrization of the curve C in R®, oriented from right to left 
when viewed from far out the line y = x in the xy-plane, formed by intersecting 
the surfaces z = x? — y* andx+y=1. 


Solution. The saddle surface z = x* — y* intersects the plane x + y = 1 to form 
a curve which cuts across the surface. Using x = t asa ee to incorporate 
right to left orientation, we see that y = 1—t andz=?t-—(1- t)? = 2t—1. Thus 
a smooth parametrization of C is @(t) = (t, 1—t, 2t—1) on J = R. In particular, 
C is a line in the direction (1, —1, 2) passing through the point (0, 1, —1). a 


The following integral arises naturally in the study of fluids, electricity, and 
magnetism (e.g., see the discussion that follows this definition). 


13.21 Definition. 


Let C be a smooth arc in R” with unit tangent T, and let (¢, J) be a smooth 
parametrization of C. If F : C — R” is continuous, then the oriented line 
integral of F along C is 


[P vas= [Fab [ FOO) -9'0 dt. (4) 
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The notation F - d@ is self-explanatory. The notation F-T ds is consistent with 
equation (3) in Section 13.1. Indeed, T = $/(t)/||¢’(a)|| and ds = ||¢’(t)|| dt, so 
in the expression F - T ds, the scalars ||@‘(t)|| cancel each other out. 

What does this number represent? If F represents the flow of a fluid, then F-T 
is the tangential component of F; that is, a measure of fluid flow in the direction 
to which the tangent T points (see Appendix E). For example, suppose that C is 
the unit circle oriented in the counterclockwise direction and F(x, y) = (—y, x). 
The unit tangent to C at a point (x, y) is (—y, x), so F points in the same direction 
that T does. Hence, F - T = 1 is an indication that the fluid is flowing “with 
the tangent” rather than against it. On the other hand, if G(x, y) = (y, —x) and 
H(x, y) = (x, y), then G-T = —1 because the fluid is flowing against the tangent, 
and H - T = 0 because the fluid is flowing orthogonally to T (e.g., neither with 
nor against it). Therefore, the integral of F - Tds over C is a measure of the 
circulation of F around C in the direction of the tangent vector. If this integral 
is positive, it means that the net flow of the fluid is with T rather than against T. 

Since an oriented line integral is a one-dimensional integral, it can of- 
ten be evaluated by techniques introduced in Chapter 5. Here is a typical 
example. 


13.22 EXAMPLE. 


Describe the trace of @(t) = (cost, sint, tf), tf € J = [0, 477], and compute 


[Fras 
C 


where F(x, y, z) = (1, cosz, xy) and C = @(/). 


Solution. Let (x, y,z) = @(t). Since x* + y? = 1, the trace of ¢ lies on the 
cylinder go y? = 1, 0<z < 4a. Ast increases, the point (x, y) winds around 
the unit circle x* + y? = 1 in a counterclockwise direction. Thus the trace of ¢ is 
a spiral (called the circular helix) which winds around the cylinder x* + y? = 1 
(see Figure 13.6). As ¢t runs from 0 to 477, this spiral winds around the cylinder 
twice, and z runs from 0 to 47. Since @’(t) = (— sint, cost, 1), we have 


Ao 
[etas= | (1, cost, cost sint) - (— sint, cost, 1) dt 
Cc 0 


Ao 
=) (— sint + cos” ¢ + sint cost) dt = 27. ler 
0 


The following result shows that, unlike the line integral /, c 8ds, the oriented 
line integral {(. F-T ds can give different values for different smoothly equivalent 
parametrizations of the same curve. 
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FIGURE 13.6 


13.23 Remark. /f (@, 1) and (W, J) are smoothly equivalent but not orientation 
equivalent, then 


i F@() (0) dt = — [ F(p(u)) - W'(u) du. 


Proof. Lett be the transition from J to J. Since t’ is continuous and nonzero, 
it is either positive on J or negative on J. Since (@, /) and (w, J) are not 
orientation equivalent, it follows that t’ is negative on J; that is, |t’(u)| = 
—t'(u) for u € J. Combining this observation with the Change-of-Variables 
Formula (Theorem 12.46) and (1) in Section 13.1, we conclude that 


i F(¢(1)) -@(t) dt = | F(o(t(u)) - $'(e(w)) [r/(w)| du 


== [ Fy (u)) <u) du. m 


By the same method, we can show that the oriented integral (4) gives iden- 
tical values for orientation equivalent parametrizations of the same curve (see 
Exercise 13.2.5). Therefore, to evaluate an oriented integral over a curve C 
whose orientation has been described geometrically, we can use any smooth 
parametrization of C and adjust the sign of the integral to reflect the prescribed 
orientation. Here is a typical example. 


13.24 EXAMPLE. 
Find 


[eres 
Cc 


where F(x, y) = (y, xy) and C is the unit circle x* + y? = 1 oriented in the 
clockwise direction. 
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Solution. The parametrization @(t) = (cost, sint), t € [0, 27], of C has coun- 
terclockwise orientation (see Example 13.4). Thus, by Remark 13.23, 


20 
if F-Tds = -| (sin ft, sint cost) - (— sint, cost) dt 
Cc 0 
20 
= / (sin’ t — sint cos” t)dt=n7. a 
0 


There is another way to represent the oriented integral (4) which uses differ- 
ential notation. Recall that if x; = $;(t), then dx; = o; (t) dt. Hence, formally, 


F(@(t)) - ¢’(t) dt looks like 
(Fy (b(t)) 0) (0) +--+ + Fin(@(t)) oy, (t)) dt = Fy dx, +--+ + Fn dxm. 


This last expression is called a differential form of degree 1 on R™ (more briefly, 
a 1-form) and the functions F; are called its coefficients. A 1-form is said to 
be continuous on a set E if and only if each of its coefficients is continuous 
on E. The oriented integral of a continuous 1-form on a smooth arc C in R” is 
defined by 


[Fiat t+ Fndani= fF Tas, 
Cc Cc 


where F = (F\,..., Fin). 

The following example illustrates the fact that differential forms provide a 
shorthand for the way an oriented line integral is computed (so we can avoid 
parametrizing explicit curves). 


13.25 EXAMPLE. 
Find 


/ y dx +cosx dy, 
Cc 


where C is the explicit curve y = x* + sinx oriented from (0, 0) to (, 2). 


Solution. Since y = x* + sinx and dy = (2x + cos x) dx, we have 


Tv 


us 
[vax teosxay = [ G2 +sinx) dx + [ cos x (2x + cos x) dx 
C 0 0 


ie Uv 


= $529, rs 
FD 


Let C = eee , Cj; be a piecewise smooth arc in R” (see the discussion pre- 
ceding Example 13.16) and T; be a unit tangent vector for C;. If F: C > R” is 
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continuous, then the oriented line integral of F along C induced by the tangents 
T; is defined to be 


N 
[era=y f F-T; ds. 
Cc ja °C , 


If w is a 1-form continuous on C, then the oriented integral of w along Cis defined 


to be 
N 
o=y fo. 
[ jae 


13.26 EXAMPLE. 
Find 
I xy dx + (x? + y’) dy, 
Cc 


where C is the boundary of Q = [0, 1] x [0, 1] oriented in the counterclockwise 
direction. 


Solution. The boundary C = 0@Q consists of four smooth pieces (see 
Figure 13.7): C; (which lies in the line x = 0), C2 (in y = 0), C3 (in x = 1), 
and C4 (in y = 1). For Cj, let x = 0 and y run from 1 to 0 (to maintain counter- 
clockwise orientation on C). Then 


0 
1 
/ xydx +02 +y*) dy = | y? dy =—-. 
Ci 1 3 


Similarly, the integrals over C2, C3, and C4 are 0, 4/3, and —1/2. Hence, 


1 4 1 1 
[¥ Tas = +QO- =n. a 
e 3 a3 9 2 
Ay 
1 =— Cy 


w 


AS 
an A 


FIGURE 13.7 


556 Chapter 13 


Fundamental Theorems of Vector Calculus 


EXERCISES 


13.2.1. 


13.2.2. 


13.2.3. 


13.2.4. 


13.2.5. 


For each of the following, sketch the trace of (¢, R), describe its orien- 
tation, and verify that it is a subset of the surface S. 


a) p(t) = Gt, 3sint, cost), S = {(x, y,z): y7 +927 =9} 
VOSS. Fy Sain. = 5) 

c) d(t) = (t, 27, sint), S={(x, y,2):y = x7} 

d) @(t) = (cost, sint, cost), S = {(x, y, z): y? 472 = 1} 
e) @(t) = (sint, sint,t), S = {(x, y,z): y =x} 


For each of the following, find a (piecewise) smooth parametrization 
of C and compute {. F-Tds. 


a) C is the curve y = x? from (1, 1) to (3, 9), and F(x, y) = (xy, y—x). 
b) C is the intersection of the elliptical cylinder y? + 2z7 = 1 with 
the plane x = —l1, oriented in the counterclockwise direction 
when viewed from far out the positive x-axis, and F(x, y,z) = 


(/x3 + y3 +5, z, x7). 


c) C is the intersection of the bent plane y = |x| with the elliptical 


cylinder x* + 3z* = 1, oriented in the clockwise direction when 
viewed from far out the positive y-axis, and F(x, y,z) = (z, -Z, 
x+y). 


For each of the following, compute /¢ w. 


a) C is the polygonal path consisting of the line segment from (1,1) 
to (2,1) followed by the line segment from (2,1) to (2,3), and w = 
ydx +x dy. 

b) C is the intersection of z = x? + y? and x* + y* + 2? = 1, oriented 
in the counterclockwise direction when viewed from high up the 
positive z-axis, and w = dx + (x + y) dy + (x* +-.xy + y?) dz. 

c) C is the boundary of the rectangle R = [a, b] x [c,d], oriented in 
the counterclockwise direction, and w = xydx + (x + y)dy. 

d) C is the intersection of y = x and y = z”,0 < z < 1, oriented 
from left to right when viewed from far out the y-axis, and w = 
/x dx + cos ydy — dz. 


a) Letc € R, 5 > 0, and set t(u) = 6u +c foru € R. Prove that if 
(@, I) is a smooth parametrization of some curve, if J = t~!(J), 
and if wy = @ot, then (¥, J) is orientation equivalent to (@, J). 

b) Prove that if (¢,/) is a parametrization of some smooth arc, 
then it has an orientation equivalent parametrization of the form 
(Wr, [0, 1). 


c) Obtain an analogue of b) for piecewise smooth curves. 


Let (¢, I) be a smooth parametrization of some arc and 1 be a C! 
function, 1-1 from J onto J, which satisfies t’(u) > 0 for all but finitely 
many u € J. If y = ¢ oT, prove that 


13.2.6 |. 


13.2.7. 


*13.2.8. 
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i F(g(t)) -¢(t) dt = / F(p(u)) - #'(u) du 


for any continuous F : @(J) > R”. 

This exercise is used in Section 13.5. Let f : [a,b] ~ R be C! on [a, b] 
with f’(t) 4 0 fort € [a, b]. Prove that the explicit curve x = fo Os 
as y runs from f(a) to f(b), is orientation equivalent to the explicit 
curve y = f(x), as x runs from a to b. 

Let V 4 @ be open in R*. A function F : V — R? is said to be 
conservative on V if and only if there is a function f : V — R such 
that F = Vf on V. Let (x, y) € V and let F = (P,Q): V > R’ be 
continuous on V. 


a) Suppose that C(x) is a horizontal line segment terminating at (x, y); 
that is, a line segment of the form L((x1, y); (x, y)), oriented from 
(x1, y) to (x, y). If C(x) is a subset of V, prove that 


i) 


— F-.T ds = P(x, y). 

Ox C(x) 
Make and prove a similar statement for 0/dy and vertical line seg- 
ments in V terminating at (x, y). 

b) Let (xo, yo) € V. Prove that 


(*) [¥ tas =0 
Cc 


for all closed piecewise smooth curves C Cc V if and only if for all 
(x, y) € V, the integrals 


fx. yk= f F.T ds 
Ca@,y) 


give the same value for all piecewise smooth curves C(x, y) which 
start at (xo, yo), end at (x, y), and stay inside V. 

c) Prove that F is conservative on V if and only if (*) holds for all 
closed piecewise smooth curves C which are subsets of V. 

d) Prove that if F is C! and satisfies (*) for all closed piecewise smooth 
curves C which are subsets of V, then 


dP dQ 

dy dx 
Note: If V is nice enough, the converse of this statement also holds (see 
Exercise 13.6.8). 


Suppose that f : [0,1] — R is increasing and continuously differen- 
tiable on [0, 1]. Let T be the right triangle whose vertices are (0, f(0)), 
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(1, f(0)), and (1, f(1)). If c represents the hypotenuse of T, a and b 
represent the legs of T, and L represents the arc length of the explicit 
curve y = f(x), x € [0, 1], prove thatc < L<a+b. 


13.3 SURFACES 


In this section we define surfaces and unoriented surface integrals, concepts 
which are two-dimensional analogues of arcs and the line integrals discussed 
in Section 13.1. Recall that a smooth arc is parametrized on a closed, bounded 
interval. On what shall we parametrize a smooth surface? Evidently, we need to 
use some type of closed, bounded set in R*. Although we could use rectangles, 
such a restriction would be awkward when dealing with explicit surfaces with 
curved projections, or with surfaces described by cylindrical or spherical coor- 
dinates. It is much more efficient to build greater generality into the definition 
of surface, using two-dimensional regions instead of rectangles (i.e., using sets of 
the following type for m = 2). 


13.27 Definition. 


An m-dimensional region is a set E C R” such that E = V for some nonempty, 
open, connected Jordan region V in R”. 


Notice that every closed, bounded interval is a one-dimensional region, ev- 
ery two-dimensional rectangle and the closure of every two-dimensional ball or 
ellipse is a two-dimensional region, and every three-dimensional rectangle and 
the closure of every three-dimensional ball or ellipsoid is a three-dimensional 
region. 


13.28 Definition. 


A subset S of R? is called a C? surface (in R*) if and only if there is a pair 
(@, E) such that E is a two-dimensional region, @ : E > R? is C? on E and 


1-1 on E°, and S = @(£). In this case we call (¢, E) a parametrization of S, S 
the trace of (@, E), and the equations 


x= (u,v), y=@U,v), 2Z=¢3(u, v), (u,v) cE 


the parametric equations of S induced by (@, £). 


Earlier, we called the graph of a function z = f(x, y) a surface. The follow- 
ing result shows that this designation is compatible with Definition 13.28 when 


fisC?. 


13.29 EXAMPLE. 


Let EF be a two-dimensional region and let f : E > Rbe aC? function. Prove 
that the graph of z = f(x, y) isaC? surface. 


Section 13.3 Surfaces 559 


Proof. If é(u,v) = (u,v, f(u,v)), then @ is C? and 1-1 on E, and @(£) is 
the graph of z = f(x,y). [This is called the trivial parametrization of z = 
FQ, y).] | 


In a similar way we define trivial parametrizations of surfaces of the form x = 
f(@,z) and y = f(x, z). For example, the trivial parametrization of the surface 
x = f(y,z), (yz) € E, is given by (¢, EF), where d(u, v) = (f(u, v), u,v). By 
an explicit surface over E we shall mean a surface of the form x = f(y,z), y= 
f(x,z), or z = f(x,y), where f : E > RisaC? function and E is a two- 
dimensional region. By the proof of Example 13.29, every explicit surface is a 
C? surface. 

The next four examples, which provide model parametrizations for certain 
kinds of surfaces, show that not every surface is an explicit surface. 


13.30 EXAMPLE. 


Show that the truncated cylinder Paes y? =1, 0<z<2,isaC@™ surface. 


Proof. Let $(u, v) = (cos u, sinu, v) and E = [0, 2s] x [0, 2], and notice that 
@ is 1-1 on E° and C™ on E. The corresponding parametric equations are 
x =cosu, y = sinu, z = v. Clearly, x? + y* = 1. Thus Q(£) is a subset of 
the cylinder x*+y? =1, 0 <z <2. Since E is connected, so is #(E). To see 
that @(£) is the entire cylinder, look at the images of horizontal line segments 
in E. The image of the line segment v = vg is a circle lying in the plane z = vo, 
centered at (0, 0, vo), of radius 1 (see Figure 13.8). Thus, as vp ranges from 0 to 
2, the images of horizontal lines v = vo cover the entire cylinder x* + y? = 1, 
O<z <2, a 


FIGURE 13.8 


13.31 EXAMPLE. 


Show that the sphere x? + y? + z* = a? isa C™ surface. 


Proof. Let $(u,v) = (acosucosv,asinucosv,asinv) and E = [0,27] x 
[—1/2, 2/2]. Clearly, ¢ is CC on E. The corresponding parametric equations 
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are Xx = acosucosv, y = asinucosv, z = asinv. Since xe y" = a’ cos? v, 
we have x7 + y* +2? = a?. Thus @(£) is a subset of the sphere centered at the 
origin of radius a. The image of the horizontal line segment v = vg is a cir- 
cle, lying in the plane z = asin vg, centered at (0, 0, asin vg) of radius a cos v9 
(see Figure 13.9). The image of the top edge (respectively, bottom edge) of 
E (i.e., of the horizontal line v = 2/2) (respectively, v = —z/2) is the north 
pole (0, 0, a) [respectively, the south pole (0, 0, —a)]. Thus, as vo ranges from 
—m/2 to 1/2, the images of horizontal lines v = vg cover the entire sphere 
P4y42=aq?. a 


(0, 1/2) (21, 1/2) 


(0, — 7/2) (2m, —1/2) 


(a, 0, 0) " ' (0, a, 0) 


FIGURE 13.9 


Let C represent the circle in the xz-plane centered at (a, 0,0) of radius b, 
where a > b. The torus centered at the origin with radii a > b is the donut- 
shaped surface obtained by revolving C about the z axis (see Figure 13.10). 


(-a, m) (1, 7) 
= | 
Va Vo 3 

) 

u=07 | 
| 

(-m, — 7) (x, -7) 

x ¥ 
(a+b, 0,0) (0,a+b,0) 
FIGURE 13.10 


13.32 EXAMPLE. 


Show that the torus centered at the origin with radii a > b is aC™ surface. 


Proof. Let d(u, v) = ((a + bcosv) cosu, (a + bcos v) sinu, bsinv) and E = 
[—z, 2] x [—7, x], and notice that @ is 1-1 on E° and C™ on E. The image of 
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u = Ois a circle in the xz-plane centered at (a, 0, 0) of radius b. The images 
of horizontal lines v = vg are circles, parallel to the xy-plane, centered at 
(0, 0, b sin vg) of radius (a + bcos vp). The image of the lines v = +7 is a circle 
in the xy-plane centered at (0,0, 0) of radius a — b. Thus $(£) covers the 
entire torus. | 


13.33 EXAMPLE. 


Let b > 0. Show that the truncated cone z = /x2+y?, 0 < z < b,isaC® 
surface. 


Proof. Let (x, y, z) = @(u, v) = (vcos u, vsinu, v) and E = (0, 27] x [0, d], 
and notice that @ is 1-1 on E° and C® on E. Clearly, x* + y? = z? and 
0 <z <b. Thus $(£) is a subset of the given cone. The image of a horizontal 
line v = vo, 0 < vo < b, isa circle in the plane z = vp centered at (0, 0, vo) of 
radius vo (see Figure 13.11). Thus $(£) is the cone z = /x2 + y2,0<z <b. 
Notice that the image of the line v = 0 is the vertex (0, 0, 0). a 


(0, b) (20, b) 


vey, 
ON 


(0, 0) (2x, 0) 


FIGURE 13.11 


Let S be aC? surface with parametrization (@, EF), and suppose that (uo, vo) € 
E°. If 6 = (¢1, ¢2, $3), then by the Implicit Function Theorem (see the proof 
of Remark 13.10), we can show that if at least one of the partial Jacobians is 
nonzero at (uo, vg) —that is, if 


0(¢;, dj 
A ¢;,¢; U0, VO) = oO uy, vo) # (0, 0, 0) (5) 


for some i # j-—then there is a C? explicit surface (fw, B) such that 
(xo, Yo, Z0) := @(uo, vo) € WCB) and W(B) Cc @(E). Since differentiable ex- 
plicit surfaces have tangent planes (see Theorem 11.22), it follows that if (5) is 
satisfied for some i # j and (xo, yo, Zo) = (uo, vo), then S has a tangent plane 
at (x0, yo, Z0). 
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The following result shows how to use a parametrization of a surface to com- 
pute a normal to its tangent plane. 


13.34 Remark. Let S be aC? surface, let (¢, E) be one of its parametrizations, 
and set @ =: (¢1, $2, $3). If (5) holds at some (uo, vo) € E° and some i 4 j, then 
anormal to the tangent plane of S at (xo, yo, Zo) = (Uo, vo) is given by 


(by, X $,)(Uo, Vo) = (Agy,¢3 (Uo, V0), Ags,¢1 (Uo, V0), Agi,¢2(Uo, Vo))- (6) 


Proof. Let IT be the tangent plane to S at ¢(uo, vo). To compute a normal 
to IT we need only find two vectors which lie in 7. But @,,(uo, vo) is tan- 
gent to the curve $(u, vo) and ¢,(uo, vo) is tangent to the curve $(uo, v) (see 
Figure 13.3). Hence, @,,(uo, vo) and @,,(uo, vo) both lie in TI (see Figure 13.12). 
Therefore, a normal to I at (xo, yo, Zo) is given by the cross product 


(Uo, vo) X Py (Uo, Vo) = (Agy,¢; (Uo. Vo), Ags,¢, (U0. V0), Ad, ,¢) (U0, vo). =< 


No 
tangent plane 


gE) 


FIGURE 13.12 


If (@, E) is a parametrization of a C! surface S, we shall use the notation 
Nou, v) := 6, (4, v) x by, v), (u,v) EE, 


to represent the vector (6). When (5) holds for some i # j, we shall call 
Ng (uo, vo) the normal induced by ¢ on S. It is easy to check that if z = f(x, y) is 
an explicit surface and ¢ is its trivial parametrization, then Ng = (— fx, —fy, 1). 
This is equivalent to the normal we used for explicit surfaces before (see 
Theorem 11.22). 

Normal vectors play the same role for surfaces that tangent vectors played 
for curves. (For example, we shall use normal vectors to define area of surfaces, 
smooth surfaces, and orientation of surfaces.) Indeed, many of the concepts for 
curves can be brought over to surfaces by replacing ¢’ by Ng. For example, 
compare the following definition with Definition 13.11. 
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13.35 Definition. 
Suppose that (¢, £) is a parametrization of a C? surface. 


i) (@, E) is said to be smooth at a point (ug, vo) € E if and only if Ng(uo, 
vo) # 0 [equivalently, if and only if || Ng (uo, vo)|| > 0]. 
ii) (¢, E) is said to be smooth if and only if it is smooth at each point in E. 
iii) (@, E) is said to be smooth off a set Ep C E if and only if (¢, E) is smooth 
at each point in E \ Eo. 


Notice that the trivial parametrization of an explicit surface is always smooth. 

Analogous to the situation for curves, a surface with a smooth parametriza- 
tion must have a tangent plane at each of its points (see Exercise 13.3.7). On 
the other hand, a surface with tangent planes at each point can have nonsmooth 
parametrizations. For example, the parametrization @ of the sphere given in 
Example 13.31 satisfies 


Nell = II (a? COs u COS” v, a” sin u cos” v, a” sin v cos v)|| = a’| cos v| 


and, hence, is not smooth when v = +7/2. (This happens because this 
parametrization takes the lines v = +7/2 to the north and south pole, and, 
hence, is not 1-1 there.) 

We shall call a surface S smooth if and only if for each point xo € S there is 
a parametrization (¢, E) of S which is smooth at (uo, vo), where xo = (uo, vo). 
Other authors call a surface smooth only when it has a smooth parametrization. 
This definition is inadequate for most “closed” surfaces (i.e., surfaces which are 
the boundary of some three-dimensional region) because those surfaces have no 
(globally) smooth parametrizations. (See, e.g., discussion of the parametrization 
of the sphere in the preceding paragraph. The sphere IS smooth by our defini- 
tion, however, since we can find other parametrizations which are “smooth” 
at the north and south poles, for example, the trivial parametrizations of each 
hemisphere.) This is typical. Every surface smooth by our definition is a union 
of surfaces with smooth parametrizations—see Exercise 13.4.7. 

The following result shows what happens to the normal vector Ng under a 
change of parameter. 


13.36 Theorem. Let (¢, E) and (W, B) be parametrizations of the same C? sur- 
face. If t is aC! function which takes B into E such that W = $ 0 t, then 


Ny(u, v) = Az u, v)Ng (tu, v)) 
for each u,v € B. 
Proof. Let = (¢1, ¢2, $3) and wW = (W1, W2, W3). By Remark 13.34, 


Ny = (Aqio,y3)> AWs.v1)» A.W) 
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Since, by hypothesis, (Wi, Wj) = (¢;,6;) 0 t fori, j = 1, 2,3, it follows from 
the Chain Rule that 


A(y;,wj)Us v) = Ar, VAG,,6)) (TU, v)) 
for any u,v € B. Therefore, Ny = Az - (Ngor) on B. g 


This leads us to the following definition (compare with Definition 13.13). 


13.37 Definition. 


Two C? parametrizations (@, EF), (w, B) are said to be smoothly equivalent if 
and only if they are smooth parametrizations of the same surface and there is 
a C? function t, which takes B onto E, such that wW = got and A;(u, v) £0 
for all (u, v) € B. The function 7 is called the transition from B to E. 


Analogous to Definitions 13.6 and 13.14, we define surface area and the sur- 
face integral as follows. 


13.38 Definition. 
Let S be asmooth C? surface and (@¢, E) be one of its parametrizations. 


i) The surface area of S is defined to be 
(y= | |Ng(u, v)|| d(u, v). 
E 


ii) If g : S > Ris continuous, then the surface integral of g on S is defined to 
be 


[[ eae [ ee. v)) Not, v) Il du, v). (7) 


The surface integral (7) can be interpreted as the mass of a membrane with 
shape $(£) and density g (see Appendix E). For an explicit C” surface S given 
by z= f(x, y), (, y) € E, this integral looks like 


| [sao =i) g(x, y/1+ £20, 9) + 2G.) d(x, y). (8) 
S E 


It can be argued on heuristic grounds that this is the right definition for 
surface area (see Appendix E). In fact, we could have defined the surface 
area of S by approximating it with planar regions, as we defined ||C|| below 
Example 13.16, by approximating it by line segments (see Price [10], p. 360). 
This approach, however, works only under suitable restrictions. Indeed, even 
when using triangular regions to approximate a bounded cylinder, the total area 
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of the approximating regions may become infinite (see Spivak [12], p. 130). We 
prefer Definition 13.381 because it is both direct and easy to use. 

Notice that by Theorem 12.24, (7) makes sense when the normal Ng(u, v) is 
undefined on a set of area zero. Thus the surface integral can be defined for 
some nonsmooth surfaces (e.g., for cones). 

It is easy to see that surface area and the surface integral are invariant un- 
der smoothly equivalent parametrizations, even when the condition A, #4 0 
is relaxed on a closed set of area zero (see Exercise 13.3.5). It is also easy to 
see that if a surface S is a subset of R*, then its surface area, as defined by 
Definition 13.38, is the same as the area of S, as defined by Definition 12.3 (see 
Exercise 13.3.4). 

To compute a surface integral, we must find a suitable parametrization of the 
given surface and apply Definition 13.38. 


13.39 EXAMPLE. 


Find f/f; g do, where S is the hemisphere zg = /a* — x? — y? and g(x, y,z) = 
Ji. 


Solution. Let @ be the function defined in Example 13.31 and E = [0, 27] 


[0, 7/2]. Then (@, E) is a parametrization of the hemisphere S and ||N¢|| = 


a’ cos v. Therefore, 


x 


m/2 A 
II gdo= II a’ cosvvVasinv du dv = ana’? f cos uv sinu dv = ae" 
S Eo 0 
| 


Continuity of g is assumed in Definition 13.38 only so that the integral on 
the right-hand side of (7) makes sense. If one of the iterated integrals is a con- 
vergent improper integral, we can extend the definition of the surface integral 
in the obvious way. Using this observation, we now offer a second solution to 
Example 13.39 using the trivial parametrization. 

ALTERNATE SOLUTION. The explicit surface z = a? — x? — y? has normal 
N = (—2x, —Zy, 1) = (x/z, y/z, 1). [This normal does not exist on 0B, (0, 0), but 
since 0B, (0, 0) is of area zero, we can ignore it when integrating over B,(0, 0).] 
Notice that on S, ||N|| = a/z. Thus, by (8) and polar coordinates, 


20 a 
4 
| {sao =f faa. =a / r(a* —r?)—'/4 dr dd = a7 gil, 
S Ba(0,0)  Z 0 Jo 3 


[The inner integral (with respect to r) is an improper integral.] | 


For even the simplest applications, we must have a theory rich enough to han- 
dle surfaces, like the boundary of the unit cube 0((0, 1] x [0, 1] x [0, 1]), which 
are not smooth but a union of smooth pieces. Consequently, we shall extend the 
theory developed above to finite unions of smooth surfaces. This expanded the- 
ory will be introduced using informal geometric descriptions instead of formal 
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statements. For now, these vague descriptions will suffice because the concrete 
surfaces which arise in practice are easy to visualize. (Spivak [12] contains a 
rigorous and more mathematically satisfying treatment of these ideas.) 

Before describing piecewise smooth surfaces, we must distinguish between in- 
terior points (points which lie “inside” a surface) and boundary points (points 
which lie on the “edge” of a surface). To illustrate the difference, consider 
the truncated cylinder S parametrized by (@, E) in Example 13.30. A point 
(x, y,z) € S lies inside S if 0 < z < 2, and on its edge if z = 0 or z = 2. (Look at 
Figure 13.8 to see why this terminology is appropriate.) Naively, we might guess 
that (x, y, z) lies on the edge of @(£) if and only if (x, y, z) ¢ @(E°). This guess is 
incorrect, even for the cylinder; for example, (1, 0, 1) = (0, 1) does not belong 
to @(E°) but does not belong to an edge of the cylinder either. (Instead, it lies 
on a “seam” of S.) Evidently, to define the interior and boundary of a general 
surface S, we must describe them geometrically. We cannot define the interior 
and boundary of a surface by using a particular parametrization (@, E). 

Accordingly, let S be a C? surface in R*. Imagine yourself standing on a point 
(x, y,z) € S. We shall say that (x, y, z) is interior to S if you are surrounded on 
all sides by points in S (i.e., if you take a sufficiently small step in any direction 
you remain on S). We shall denote the set of interior points of a surface S by 
Int(S) and shall define the (manifold) boundary of a surface S by 0S := S\Int(S). 

We have used the same notation to denote the boundary of a surface as we 
did to denote the boundary of a set (see Definition 9.16 or 10.28) even though 
these concepts are not the same. We made this choice because it homogenizes 
the statements of all the fundamental theorems of multidimensional calculus. To 
avoid ambiguity, we shall henceforth refer to the boundary of a region E (i.e., 
to E \ E°) as the topological boundary of E. No confusion will arise because 
the only boundary we use in connection with surfaces is the manifold boundary, 
and the only boundary we use in connection with m-dimensional regions is the 
topological boundary. 

A surface S is said to be closed if and only if dS = 9. For example, if a > 0, 
then the sphere x?+y*+2? = a’ is closed, but the hemisphere z = \/a2 — x? — y? 
(respectively, the truncated paraboloid z = x* + y*, 0 < z < 1) is not closed, 
since its boundary is x? + y* = a*, z = 0 (respectively, x7 + y* =1, z= 1). 

By the Jordan Curve Theorem, a closed arc C divides R* into two or more 
disjoint connected sets, the bounded components “surrounded” by C and the 
unbounded component which lies “outside” C. This is not the case for closed 
surfaces. Indeed, there are closed smooth surfaces (the Klein bottle is one ex- 
ample) which surround no points and, hence, do not divide R? into disjoint sets 
(see Hocking and Young [4], p. 237). 

A set S Cc R? will be called a piecewise smooth surface if and only if S = 
us j, Where each S; = ($;, Ej) is a smooth surface and for each j # k either 
S; 1 S_ is empty, or a portion of the boundary of $; is matched to a portion of 
the boundary of S;. Thus a piecewise smooth surface might consist of several 
disjoint, smooth surfaces, like the topological boundary of the corona0 <a < 
l|(x, y, Z)|| < b, or several connected pieces with ridges, like the concentric boxes 
a[({O, 3] x [0, 3] x [0, 3])\ (f1, 2] x [1, 2] x [1, 2])]. We make the further restriction 


Section 13.3 Surfaces 567 


that the intersection of any three S;’s is empty, or a finite set. This prevents a 
piecewise smooth surface from doubling back on itself more than once along 
any given edge. 

Let S= Ut 5; be a piecewise smooth surface. By a parametrization of S we 
mean a collection of smooth parametrizations (@ ;, E;) of S;. Two parametriza- 
tions (pj, Ej), (Wj. B;) are said to be smoothly equivalent if and only if (p;, Ej) 
is smoothly equivalent to (w;, Bj) for 7 = 1,..., N. The boundary, 0S, of S is 
defined to be the union of all points which belong to the closure of an unmatched 
portion of 0S;. (¢.g., the boundary of the box formed by removing the face z = 1 
from the unit cube [0, 1] x [0, 1] x [0, 1] is the unit square in the plane z = 1, and 
the boundary of the union of x? + y*? = 1, —3 < z <0,andz = /1 — x? — y? is 
the unit circle in the plane z = —3.) The surface area of S is defined by 


N 
o(S) =) \a(Sj) 
j=l 


and the surface integral of a real-valued function g continuous on S is defined by 


jfee7~ 2 ff ee 


13.40 EXAMPLE. 


Let S be the tetrahedron formed by taking the topological boundary of the re- 
gion bounded by x = 0, y = 0, z = 0, andx +y+z = 1. Find a piecewise 
smooth parametrization S and compute /'/, 5 8 do, where g(x, y,z) =x+ yreZ?. 


Solution. The tetrahedron has four faces which can be parametrized by 
o,(u,v) = (u,v,0), du,v) = (u,v), O3(u,v) = (u,0,v), d4(u,v) = 
(u,v, 1 —u — v), where (u, v) belongs to E, the triangular region with vertices 
(0, 0), (1, 0), and (0, 1). Since No, ll = 1 for j = 1,2,3 and ||Ng,|l = /3, 
we have 


1 l—u 1 l-u 
[fsao=[ w+ dvdu+ ff (u? + v*) dv du 
S 0 JO 0 JO 
1 l—-u 
+f i (u+v°) dv du 
0 JO 


1 l-u 
+v3f f (ut+tvu> +(1—u—v)3)) du du 
0 JO 


1 l—u 
-| i (2+ V3)u +0? + (1+ ¥3)0? + 207 
0 0 


+ V73(1 —u —v)3) dv du 
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: 1+ v3 
= i (2+ V¥3)u — (1+ V3)u* — uw? 4 ere u)? 
0 
2 3 
+2480 yh du 
3 
= —(2 3). a 
92+ ¥3) 
EXERCISES 
13.3.1. For each of the following, find the surface area of S. 


13.3.2. 


13.3.3. 


13.3.4. 


a) Sis the conical shell given by z = x? + y?, where a < z <b. 
b) Sis the sphere given in Example 13.31. 
c) Sis the torus given in Example 13.32. 


For each of the following, find a (piecewise) smooth parametrization of 
S and of 4S, and compute J'/, g do. 


a) S is the portion of the surface z = x* — y? which lies above the 
xy-plane and between the planes x = 1 and x = —1, and g(x, y,z) = 
V1+4x2 +4y?. 

b) Sis the surface y= x7, O< y <8, O<z<4,and g(x, y,z) = x°z. 

c) Sis the portion of the hemisphere z = \/9 — x” — y? which lies out- 
side the cylinder 2x? + 2y* = 9, and g(x, y,z) =x +y+z. 


Find a parametrization (@, E) of the ellipsoid 
2 2 2 
x y Z 
ZR a 


which is smooth off the topological boundary 0E. 
a) Suppose that E is a two-dimensional region and that S = {(x, y, z) € 
R? : (x, y) € E and z = 0}. Prove that 


Area (E) = II do 
Ss 
II gdo =f g(x, y,0) d(x, y) 
Ss E 


for each continuous g: E > R. 

b) Let f : [a,b] ~ R be aC? function, let C be the curve in R? de- 
termined by z = f(x), a < x < b, and let S be the surface in 
R? determined by z = f(x), a < x < b, c < y < d. Show that 
a(S) =(d—c)L(C). 

Let f : [a,b] ~ R be aC? function and let S be the surface obtained 
by revolving the curve y = f(x), a < x < b, about the x-axis. Prove 
that the surface area of S is 


and that 


Cc 


wa 
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b 
a(S) = an f IFO I+ 1S)? dx. 


13.3.5. Suppose that ¥(B) and @(£) are C? surfaces and that wy = ¢ oT, where 
t isaC! function from B onto Z. 


a) If (W, B) and (@¢, E) are smooth and rt is 1-1 with A, 4 0 on B, 
prove that 


[[ soe. einen. o re av= ff sve. 1))||Ny(s, 1)|| ds dt 


for all continuous g : (E) > R. 
*b) If Z is a closed subset of B of area zero such that (w, B) is smooth off 
Z, tis 1-1, and A; #0o0n B°\Z, prove that 


[[ owning. ool re av= [fev 1))||Ny(s, 1)I| ds dt 


for all continuous g : (£) > R. 


13.3.6. Suppose that f : B3(0, 0) — R is differentiable with ||V f(x, y)|| < 1 for 
all (x, y) € B3(0, 0). Prove that if S is the paraboloid 2z = ey? 0 
z <4, then 


II | f(x, y) — f (0, 0)| do < 40z. 
S 


13.3.7. Suppose that $(£) is aC? surface and that (xo, yo, zo) = (uo, vo), Where 
(uo, vo) € E°. If Ng(uo, vo) ¢ 0, prove that @(£) has a tangent plane at 
(xo, Yo, 20). 

13.3.8. Let w(B) be a smooth surface. Set E = |ly¥, ||, F = ¥, -W,, and G = 
lv, ||. Prove that the surface area of S is [, VE*G? — F7d(u, v). 

13.3.9. Suppose that S is a C! surface with parametrization (@, E) which is 
smooth at (xo, yo, Zo) = @(uo, vo). Let (W, J) be a parametrization of a G} 
curve in E which passes through the point (uo, vo) [i.e., there is a fo € J 
such that # (to) = (uo, vo)]. Prove that (@ o W)’ (to) - (b,, X ,) (Uo, Vo) = 0. 


13.4 ORIENTED SURFACES 


Recall that a smooth curve @(J/) is oriented by using the tangent vector @’(r) 
to choose a “positive direction.” Analogously, a smooth surface S$ = @(E) will 
be oriented by using the normal vector Ng to choose a “positive side.” Since 
smooth surfaces are by definition connected, such a choice will be possible if $ 
has two, and only two, sides. 

A new complication arises here. There are smooth surfaces which have only 
one side. (The following example of such a surface can be made out of paper by 
taking a long narrow strip by the narrow edges, twisting it once, and gluing the 
narrow edges together.) 
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13.41 EXAMPLE. [THE MOBIUS STRIP]. 


Sketch the trace of (@,E), where @(u,v) = ((2 + vsin(u/2))cosu, (2 + 
v sin(u/2)) sinu, vcos(u/2)) and FE = [—z, 2] x [—1, 1]. 


Solution. The image of the horizontal line v = 0 under @ is (2 cos u, 2 sinu, 0) 
(i.e., the circle in the xy-plane centered at the origin of radius 2). The image of 
each vertical line uw = uo is a line segment in R? which rotates through space 
as ug increases. For example, the image of u = 0 is (2,0, v), —1 < v < 1, and 
the image of u = +7 is the seam Sp := (—2 F v,0,0), —1 < v < 1; that is, 
the set of points {(x, 0,0) : —3 < x < —1}. Thus the trace of (@, E) is given in 
Figure 13.13. a 


(7, -1) 


(-2, -1) 


FIGURE 13.13 


To avoid such anomalies, we introduce the following concepts. The unit nor- 
mal of a smooth surface S, at a point (xo, yo, Zo) on S, induced by one of its 
parametrizations (@, E) is the vector n(x0, yo, z0) = N¢(uo, vo)/I|Ng (uo, vo), 
where (uo, vo) = (x0, yo, Zo). Evidently, the unit normal n is well defined 
only when 

Ng (uo, vo) No (ut, v1) 


INg(uo, vo) ll Ng (ur, v1) I 


for all (uj,vj) € E which satisfy @(u;,v;) = (xo, yo, zo) for 7 = 0,1. This 
will surely be the case if @ is 1-1 and smooth on E. If ¢ fails to be 1-1 on 
E, however, the unit normal n might not be well defined, even though (@, E) 
is smooth on E [see the Mobius strip above where ¢(z,v) = @(—z, v) but 
No(x, v) = —Ng(—7, v) for all v]. 

A smooth surface S$ is said to be orientable if and only if it has a smooth 
parametrization (@, E) which induces an unambiguous unit normal n on S that 
varies continuously over S; that is, if @(uo, vo) = @(u1, v1), then Ng (uo, vo) points 
in the same direction as Ng (uj, v1), and if (u2, v2) is near (uo, vo), then Ng (uz, v2) 
points in approximately the same direction as Ng(uo, vo). (A formal definition 
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of orientable can be found in Spivak [12].) If S is orientable, then its unit normal 
can be used to choose a “positive” side (the side from which n points). 

Henceforth, by a parametrization of an orientable surface S we mean a smooth 
(@, E) which induces an unambiguous unit normal on S. 


13.42 Definition. 


Two parametrizations (¢, E) and (, B) are said to be orientation equivalent if 
and only if they are parametrizations of the same orientable surface, smoothly 
equivalent with transition t, and A,(u, v) > 0 for all (u, v) € B. 


By Theorem 13.36, if (@, E) and (%, B) are orientation equivalent, then the 
normal vectors they generate point in the same direction. Thus the positive side 
chosen by (@, E) is the same as the positive side chosen by (w, B). 

Oriented surface integrals can be defined using the unit normal in the same 
way that oriented line integrals were defined using the unit tangent (compare 
the following definition with Definition 13.21). 


13.43 Definition. 


Let S be a smooth orientable surface with unit normal n determined by a 
parametrization (¢, E). fF: S > R? is continuous, then the oriented sur- 
face integral of F on S is 


[ [endo [ ody. 2)-Now.v) d(u, v). 
Ss E 


The notation of the left-most integral is consistent with the notation in (7) 
since n = Ng/||Ng|| and do = ||Ngl| du, v). 

Notice that the trivial parametrization always induces an unambiguous nor- 
mal on an explicit surface. In fact, if S = {(x, y,z):z = f(x, y), (x, y) € E}, 
Definition 13.43 takes the form 


[[# x40 = fF, 9 FED) Sin —fi eee (9) 


Things are not so simple for smooth surfaces which are the boundary of 
a three-dimensional region (like the sphere) and for surfaces which are not 
smooth (like the cone), because their parametrizations have at least one point 
where the normal is zero and, hence, the unit normal cannot be defined. Nev- 
ertheless, as was the case for the oriented line integral, the oriented surface 
integral can be defined when the normal fails to exist on some set of area zero 
(see Exercise 13.4.4). We need to be careful, however, with the definition of ori- 
entable. If the collection of nonsmooth points cuts across the entire surface (like 
the peak of a pup tent or the edge of a pyramid), we have difficulty defining what 
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it means to have a “continuously varying” normal. We shall address this prob- 
lem for piecewise smooth surfaces at the end of this section. In the meantime, 
notice that we can define what it means for a surface S = @(E) to be orientable 
if the set of singularities [i.e., the set of (x, y, z) € R° such that (x, y, z) = b(u, v) 
for some (u,v) € E which satisfies Ng(u,v) = 0] is finite. In particular, the 
standard parametrizations of spheres and cones can be used in Definition 13.43. 

What does an oriented surface integral represent? If F represents the flow of 
an incompressible fluid at points on a surface S, then F-n represents the normal 
component of F (i.e., the amount of fluid which flows in the direction of n) (see 
Appendix E). Thus the integral of F-n do on S, a measure of the flow of the 
fluid across the surface S in the direction of n, is sometimes called the flux of F 
across S. In particular, we should not be surprised when many of these integrals 
turn out to be zero. 

It is easy to see that the integral of F-n do ona surface S does not change 
when orientation equivalent parametrizations are used (see Exercise 13.4.4). 
The following result shows that a change of orientation changes the value of the 
oriented surface integral by a minus sign. 


13.44 Remark. /f (¢, E) and (W, B) are smoothly equivalent but not orientation 
equivalent, then 


[ Fou. v))- Ng(u, v) dtu, v) = - [ Foe, t))- Ny(s, t) d(s, ft). 


Proof. Let t be the transition from B to E. Since A, is continuous and 
nonzero on the connected set B, and (@, E) and (w, B) are not orientation 
equivalent, we have A; < 0 on B. Hence, it follows from Theorem 13.36 and 
Theorem 12.46 (the Change-of- Variables Formula) that 


[Fae, t))- Ny(s, t) d(s,t) = =f |Az(s, 1)|((Fogot)(s,t)- (Ngo T)(s, f) 
--f F(o(u, v)) - Ng(u, v) dtu, v) 
1(B) 
— -| F(g(u, v)) - Ng(u, v) dtu, v). | 
E 


Therefore, when evaluating an oriented integral on a surface S whose orienta- 
tion has been described geometrically, we can use any smooth parametrization 
of S and adjust the sign of the integral to reflect the prescribed orientation. Here 
is a typical example. 


13.45 EXAMPLE. 


Find the value of [fF -n do, where F(x, y, z) = (xy, x — y,z), S is the planar 
regionx +y+z=1, (x,y) € [0,1] x [0, 1], and n is the downward-pointing 
normal. 
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Solution. The usual normal (1, 1, 1) of the plane x + y + z = 1 points upward 
rather than downward. Thus, by Remark 13.44, 


1 1 
1 
[[enao=- fof cys-yt-a-y G1. naray= 3. a 
s 0 JO 4 


It is convenient to have a “differential” version of oriented surface integrals. 
To see how to define differentials of degree 2, let S = @(E) be a smooth ori- 
entable surface and x = ¢i (u,v), y = d2(u, v), z = $3(u, v). By Remark 13.34, 


(2 d(Z, x) 2) 
(u,v) A(u, v)’ A(u, v) J” 


Therefore, the oriented surface integral of a function F = (P, Q, R) : @(E) > 
R? has the form 


i (? COE) gps MD ape a2) Ae) 
E d(u, v) O(u, Vv) d(u, Vv) 


= ff Pay dz+ Qdzdx + Raxdy 
S 


that is, we should define differentials of degree 2 by 


a(y, Z) 0(z, x) a(x, y) 
aes | oy ade Ge aa 
TOG ea tT EO Gay, oe ROD tare an 


d(u, v). 


[These are two-dimensional analogues of the differential dy = f’(x)dx.] Bya 
2-form (or a differential form of degree 2) on a set Q C R* we mean an expres- 
sion of the form 


Pdydz+Qdzdx+Rdx dy, 


where P,Q, R: Q > R. A 2-form is said to be continuous on Q if and only if its 
coefficients P, Q, R are continuous on Q. The oriented integral of a continuous 
2-form on a smooth surface S oriented with a unit normal n is defined by 


[fp ayaz+ Qdzas+ Raxdy = [f[(P,0, 8) nab. 
S S 


Differential forms of degree 1 were formal devices used in certain computa- 
tions (e.g., to compute an oriented line integral or to estimate the increment of a 
function). Similarly, differential forms of degree 2 are formal devices which will 
be used in certain computations (e.g., to compute an oriented surface integral). 
They can also be used to unify the three fundamental theorems of vector calcu- 
lus presented in the next two sections (see Spivak [12]). There is a less formal 
way to introduce differentials in which the differential dx can be interpreted as 
the derivative of the projection operator (x, y, z) H+ x (see Spivak [12], p. 89). 
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In general, the boundary of a surface is a curve. Since the boundary of the 
MObius strip is a simple closed curve, the boundary of a surface may be ori- 
entable even when the surface is not. 

Suppose that S is an oriented surface and that 0S is a piecewise smooth curve. 
The orientation of S can be used to induce an orientation on 0S in the following 
way. Imagine yourself standing close to 0S on the positive side of S. The direc- 
tion of positive flow on 0S moves from right to left (1.e., as you walk around the 
boundary on the positive side of S in the direction of positive flow, the surface 
lies on your left). This orientation of 0S is called the positive orientation, the 
right-hand orientation, or the orientation on 0S induced by the orientation of S. 
When S is a subset of R? (i.e., of the xy-plane), we shall say that 4S is oriented 
positively if it carries the orientation induced by the upward pointing normal on 
S (i.e., the normal which points toward the upper half space z > 0). Thus if 
S is a bounded subset of R* whose boundary is a connected piecewise smooth 
closed curve, then the usual orientation on S induces a counterclockwise orien- 
tation on 0S when viewed from high up on the positive z-axis. This is not the 
case, however, when E has interior “holes.” For example, if E = {(x, y) : a < 
x? + y* < b?} for some a > 0, then the positive orientation is counterclockwise 
on {(x, y) : x7 + y? = b*} but clockwise on {(x, y) : x7 + y* = a’}. This informal 
geometric description is sufficient to identify the induced orientation in most 
concrete situations. Here is a typical example. 


13.46 EXAMPLE. 


Let S be the truncated paraboloid z = Ry 0 <z < 4, with outward-pointing 
normal. Parametrize 0S with positive orientation. 


Solution. The boundary of S is the circle x* + y* = 4 which lies in the z = 
4 plane. The positive orientation is clockwise when viewed from high up the 
z-axis. Therefore, a parametrization of 0S is given by @(t) = (2sint, 2 cost, 4), 
t € [0, 27]. ie 


How do we extend these ideas to piecewise smooth surfaces? If S$ = US;, 
it is not enough to assume that each S$; is orientable, because the MObius strip 
is the union of two orientable surfaces, namely $(£,) and @(£2), where @¢ is 
given by Example 13.41 and Ex, = [m(k — 2),a(k — 1)] x [-1, 1], & = 1,2. 
We shall say that a piecewise smooth surface $ = US; is orientable if and only 
if one can use the normals +Ng, to generate a unit normal nj on each piece 
S; which identifies the “positive side” in a consistent way (e.g., all normals on 
one connected piece point outward and all normals on another connected piece 
point inward). If S = UE 5; 1S orientable, then the oriented surface integral of 


a continuous function F : § > R? is defined to be 


N 
[[ena=> ff F «nj do. 
Ss = Sj 


The following three examples provide further explanation of these ideas. 
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13.47 EXAMPLE. 


Evaluate 


[[ ena. 
S 


where S is the topological boundary of the solid bounded by the cylinder x? + 
y* = 1 and the planes z = 0, z = 2; nis the outward-pointing normal; and 
F(x, y; Z) => (x, 0, y). 


FIGURE 13.14 


Solution. This surface has three smooth pieces: a vertical side $;, a bottom Sp, 
and a top 53 (see Figure 13.14). Parametrize S; by @(u, v) = (cosu, sinu, v), 
where E = [0, 277] x [0, 2]. Thus Ng = (cosy, sinu, 0) and 


2 20 
II F-ndo = | / cos* u du dv = 2n. 
Sy 0 JO 


Since the outward-pointing unit normal to S; isn = (0,0,—1), we see by 
Exercise 13.3.4a that 


20 1 
II F-ndo =~ [ yate.y=— f [ Psino ar ao =o. 
So B, (0,0) 0 0 


Similarly, the integral on S3 is also zero. Therefore, 


[ [endo = 20 4040-28, a) 
S 
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13.48 EXAMPLE. 


Find //, F-ndo, where F(x, y, z) = («+ z7,x,2z), Sis the topological boundary 
of the solid bounded by the paraboloid z = x? + y* and the plane z = 1, and nis 
the outward-pointing normal. 


Solution. The surface S has two smooth pieces: the paraboloid S; given by z = 
x7 +4 ye 0 < z < 1, and the disk S> given by x7 + y? <1, z= 1. The trivial 
parametrization of S; is @(u,v) = (u,v, u? + v*), (u,v) € B,(0,0). Note that 


Ng = (—2u, —2v, 1) points inward (the wrong way). Thus, by Remark 13.44 and 
polar coordinates, 


II F-ndo = -| (—2u* — 2u(u* + v*)? — 2uv + (u? + v*)) dtu, v) 
Sy By (0,0) 
1 20 
— i / (2r? cos’ 6 + 2r°> cos 6 + 2r* cos 6 siné — r’)r dé dr = 0. 
0 JO 


Since the unit outward-pointing normal of Sz isn = (0,0, 1) andF-n=z= 1 
on Sz, we see by Exercise 13.3.4a that 


II F-ndo = | d(x, y) = Area(B, (0, 0)) =z. 
So B, (0,0) 


[ [Pra =04+0=7. |_| 
S 
13.49 EXAMPLE. 


Compute f/f, F-ndo, where F(x, y, z) = (x, y, z); Sis the topological boundary 
of the solid bounded by the hyperboloid of one sheet x? + y? — z? = 1 and the 
planes z = —1, z = V3; and nis the outward-pointing normal to S. 


Therefore, 


Solution. The surface S has three smooth pieces: a top S}, a side S,, and a 
bottom S3 (see Figure 13.15). Using n = (0, 0, 1) for S;, we have 


II F-ndo = V3 d(x, y) = 4V3n. 
S Bz (0,0) 


// F-ndo = 2n. 
S3 


To integrate F-non 5), let z = u and note that x* 4+ y* =1+u?. Thus (u, v) = 
((1 + u?)cos v, (1 +u7)sinv, uv), (u,v) € [—1, V3] x [0, 27], is a parametrization 
of Sz. Since Ng = (—(1+ u?)cos v, —(1 +u2)sinv, 2u(1+u7)) points inward and 


Similarly, 
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Ae Ss; 


(0,0,V3) @ 


S; 0, -1) 


FIGURE 13.15 


F-Ng = (1+ u’)cos v, (1 + u*)sin v, u)- 
-(-( +. u?)cos v, —(1 +.u?)sin v, 2u(1 + u?)) 
= —(1+u*)? + 2v?(1 +0) =u4 - 1, 


we have 
J3 p2r 
II F-ndo =~ [ / (u* — 1) dv du 
So —1 0 
v3 8 
=2 f Ga ae = (1 — V3). 
-1 
Therefore, 
8 6 
[ [endo = avin 420 + Za - V5 = Zo +25. | 
S 
EXERCISES 


13.4.1. For each of the following, find a (piecewise) smooth parametrization of 
aS which agrees with the induced orientation, and compute /,, F - Tds. 


a) S is the truncated paraboloid y = 9 — x? — 2?, y > 0, with outward- 
pointing normal, and F(x, y, z) = (x7y, y*x, x ty +2). 

b) Sis the portion of the plane x + 2y + z = 1 which lies in the first oc- 
tant, with normal which points away from the origin, and F(x, y, z) = 
C= 7.7 282"). 

c) Sis the truncated paraboloid z = x* + y*, 1 < z < 4, with outward- 
pointing normal, and F(x, y,z) = (5y + cosz, 4x — sinz, 3xcosz + 
2y sin z). 
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For each of the following, compute //, F-ndo. 


a) Sis the truncated paraboloid z = x7+ y*, 0 < z < 1, mis the outward- 
pointing normal, and F(x, y, z) = (x, y, z). 

b) Sis the truncated half-cylinder z = \/4 — y?, 0 < x < 1, mis outward- 
pointing normal, and F(x, y, z) = (x2 + ye yZ, 2°), 

c) S is the torus in Example 13.32, n is the outward-pointing normal, 
and F(x, y, z) = (y, —x, Zz). 

d) Sis the portion of z = x” which lies inside the cylinder x? + y* = I,n 
is the upward-pointing normal, and F(x, y, z) = (y?z, cos(2 + log(2 — 
47 = '97)), 8°): 

For each of the following, compute /f; w. 


a) S is the portion of the surface z = x* + y* which lies over the unit 
square [0, 1] x [0, 1], with upward-pointing normal, and w = x dy dz+ 
ydzdx+zdxdy. 

b) Sis the upper hemisphere z = \/a? — x? — y?, with outward-pointing 
normal, and w = xdydz+ ydzdx. 

c) Sis the spherical cap z = ,/a? — x? — y? which lies inside the cylinder 
x? 4 y* = b*, 0 < b < a, with upward-pointing normal, and w = 
xzdydz+dzdx+zdx dy. 

d) Sis the truncated cone z = 2,/x? + y?, 0 < z < 2, with normal which 
points away from the z-axis, and w = x dy dz + ydz dx + z* dx dy. 


Suppose that w(B) and ¢(£) are C? surfaces and that » = ¢ oT, where 
t isaC! function from B onto E. 


a) If (W, B) and (¢, E) are smooth, and 7 is 1-1 with A; > 0 on B, 
prove for all continuous F : (£) > R? that 


[ Fou. v))- Nou, v) dtu, v) = [ Fav. t))- Ny(s, t) d(s, t). 


*b) Suppose that Z is a closed subset of B of area zero, that (w, B) is 
smooth off Z, and that t is 1-1 with A; > 0 on B°\Z. Prove for all 
continuous F : (E) — R? that 


[ Fou. v))- Ng(u, v) dtu, v) = [ Fav. t))- Ny(s, t) d(s, t). 


Let E be the solid tetrahedron bounded by x = 0, y = 0, z = 0, and 
x +y-+z = 1, and suppose that its topological boundary, T = dE, 
is oriented with outward-pointing normal. Prove for all C! functions 
P,O,R:E — R that 


II P dy dz+ Qdzdx+Rdxdy= ff (Py + Oy + Rz) dV. 
aE E 
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13.4.6. Let T be the topological boundary of the tetrahedron in Exercise 13.4.5, 
with outward-pointing normal, and S be the surface obtained by taking 
away the slanted face from T (i.e., S has three triangular faces, one each 
in the planes x = 0, y= 0, z = 0). If 0S is oriented positively, prove for 
all C! functions P, 0, R: S > R that 


i Pdx+Qdy+Rdz= | [ (Ry ~ 0.) dy dz (P.~ Ry) dz dx 
as S 
+ (Qx — Py) dx dy. 


13.4.7. Suppose that S is a smooth surface. 


a) Show that there exist smooth parametrizations (¢;, Ej) of portions 
of S such that S$ = Ui (Ej). 

b) Show that there exist nonoverlapping surfaces S$; with smooth 
parametrizations such that S = US j- What happens if S is 
orientable? 


13.5 THEOREMS OF GREEN AND GAUSS 
Recall by the Fundamental Theorem of Calculus that if f is aC! function, then 


b 
fo) - fay =f foat. 


Thus the integral of the derivative f’ on [a, b] is completely determined by the 
values f takes on the topological boundary {a, b} of [a, b]. 

In the next two sections we shall obtain analogues of this theorem for func- 
tions F : Q > R”, where Q is a surface or an m-dimensional region, m = 2 or 3. 
Namely, we shall show that the integral of a “derivative” of F on Q is completely 
determined by the values F takes on the “boundary” of Q. Which “derivative” 
and “boundary” we use depends on whether © is a surface or an m-dimensional 
region and whether m = 2 or 3. 

Our first fundamental theorem applies to two-dimensional regions in 
the plane. 


13.50 Theorem. [GREEN’S THEOREM]. 
Let E be a two-dimensional region whose topological boundary dE is a piece- 
wise smooth C! curve oriented positively. If P,Q : E — R are C! and 


F = (P, Q), then 
/ F-tds= [f (so - =) aa. 
JE E\0x dy 


PROOF FOR SPECIAL REGIONS. We will prove Green’s Theorem when E is a finite 
union of nonoverlapping regions each of which is of types I and II. For a proof 
of Green’s Theorem as stated, see Spivak [12]. 
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Suppose first that E is a single region of types I and II. Write the integral on 
the left in differential notation, 


[ ratow=/ Pax+ | QOdy= +h. 
aE dE aE 


We evaluate /; first. Since E is of type I, choose continuous functions f, g : 
[a, b] + Rsuch that 


E={(x,y)€R?:a<x <b, f(x) <y <g(x)}. 


Thus dE has a top y = g(x), a bottom y = f(x), and (possibly) one or two 
vertical sides (see Figure 13.16). 


Ay 
y= 8) 
t we 
a vertical side ' t 
——— ie 
on al 
t y=f@) 
> 
x 
FIGURE 13.16 


Since the positive orientation is counterclockwise, the trivial parametrization 
of the top is y = g(x), where x runs from bd to a, and of the bottom is y = f(x), 
where x runs from a to b. Since dx = 0 on any vertical curve, the contribution 
of the vertical sides to 4; is zero. Thus it follows from Definition 13.21 and the 
one-dimensional Fundamental Theorem of Calculus that 


b a 
qh = Pax =f P(x, f(x)) ax+ [ P(x, g(x)) dx 
OE a b 
b 
-j (P(x, g(x)) — P(x, f(x)) dx 


g(x) gp 
--[ | ae yydydx=— ff aa, 
f(x) E 


Since E is of type II, a similar argument establishes 


h= [ ow = ff Eas. 


[Here, we have changed parametrizations of 0 £, for example, replaced y = f(x) 
by x = f—!(y). The value of the oriented integral does not change because these 
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Ey 


Ey 


FIGURE 13.17 


parametrizations are orientation equivalent—see Exercise 13.2.6.] Adding /, 
and J» completes the proof when E is of type I and II. 

Since disjoint regions have disjoint boundaries, it remains to verify Green’s 
Theorem for two-dimensional regions which can be divided into a finite number 
of regions each of which is of types I and II (see Theorem 12.23). By induction, 
it is enough to examine a region E which can be divided into two contiguous 
regions (see Figure 13.17). Notice that although E is not of type II, the regions 
E) and E? are both of types I and Il. Applying Theorem 13.50 to each piece, 
we find 


UL (22) ff, (2-B) (2-3) 
E\ Ox dy E, \ 0x dy Ey \ Ox dy 
=f F-Tds + | F.T ds 
aE, aE2 


= F-Tds+ | F-Tds+ | F-T ds, 
aE CNIE\ CNIE2 


where C is the common border between £; and E>. Since dF; and dE are 
oriented in the counterclockwise direction, the orientation of CN dE is different 
from the orientation of C N dE. Since a change of orientation changes the sign 
of the integral, the integrals along C drop out. The end result is the integral of 
F . Tds on 0E, as promised. | 


Green’s Theorem is often used to avoid tedious parametrizations. 


13.51 EXAMPLE. 


Find , jp F-T ds, where E = [0,2] x [1,3], 0£ has the counterclockwise orien- 
tation, and F(x, y) = (xy, x* + y’). 


Solution. Since dE has four sides, direct evaluation requires four separate 
parametrizations. However, by Green’s Theorem, 
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2 3 
i F-Tas= [ [ex-nayax=4. La 
OE 0 1 


Green’s Theorem is also used to avoid difficult integrals. 


13.52 EXAMPLE. 
Find ie 7 F-Tds, where E = B\(0,0), dE has the clockwise orientation, and 
F = (xy?, arctan(log(y + 3)) — x)). 


Solution. The second component of F looks tough to integrate. However, by 
Green’s Theorem, 


/ F-Tas=— | f (—1 — 2xy) dx dy 
aE B,(0,0) 


20 1 
= i (1 + 2r? cos@ sin@)r dr d6 =n. 
0 JO 


(Note: The minus sign appears because dF is oriented in the clockwise 
direction.) a 


By Green’s Theorem, the “derivative” used to obtain a fundamental theorem 
of calculus for two-dimensional regions in R? is Q, — Py. Here are the “deriva- 
tives” which will be used when @ is a surface in R? or a three-dimensional region. 


13.53 Definition. 


Let E be a subset of R? and let F = (P, O, R) : E > R* be C! on E. The curl 
of F is 


dR 00 0P OR 0Q_ OP 
curl F = , 


dy dz’ dz dx’ dx dy 


and the divergence of F is 


Notice that if F = (P, Q,0), where P and Q are as in Green’s Theorem, then 
curl F -k = Q, — Py is the derivative used for Green’s Theorem. 
These derivatives take on a more easily remembered form by using the 


notation 
ad 0 da 
V=(—,—,-—]. 
ax Oy az 


Indeed, curl F = V x FanddivF=V.-F. 
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If E is a three-dimensional region whose topological boundary is a piecewise 
smooth orientable surface, then the positive orientation on dE is determined by 
the unit normal which points away from E°. If EF is convex, this means that n 
points outward. This is not the case, however, when E has interior “bubbles.” 
For example, if E = {x: a < ||x|| < b} for some a > 0, then n points away from 
the origin on {x : ||x|| = b} but toward the origin on {x : ||x|| = a}. 

Our next fundamental theorem applies to the case when Q is a three- 
dimensional region. This result is also called the Gauss’s Theorem. 


13.54 Theorem. [THE DIVERGENCE THEOREM]. 
Let E be a three-dimensional region whose topological boundary 0E is a piece- 
wise smooth C! surface oriented positively. If F : E > R° is C! on E, then 


[[_Fendo= [ff awrav. 


PROOF FOR SPECIAL REGIONS. We will prove the Divergence Theorem when E 
is a finite union of nonoverlapping three-dimensional regions each of which is 
of types I, II, and III. For a proof of the Divergence Theorem as stated, see 
Spivak [12]. 

Suppose first that E is a single region of types I, II, and III. Let F = (P, Q, R) 
and write the surface integral in differential form: 


II F-ndo = |[ Paydz+ ff Qazdx+ ff Rdxdy=h+ht+h. 
JE JE JE OE 


We evaluate /3 first. 
Since E is of type I, there exist a two-dimensional region B C R? and contin- 
uous functions f, g : B > R such that 


E={(x, y,z) €R?: (wy) € B, f(x,y) <z < a(x, y)}. 


Thus 0£ has a top z = g(x, y), a bottom z = f(x, y), and (possibly) a vertical 
side (see Figure 13.18). Any normal to dE on the vertical side is parallel to the 
xy-plane. Since dx dy is the third component of a normal to dE, it must be zero 
on the vertical portion. Therefore, /; can be evaluated by integrating over the 
top and bottom of 0 £. Notice that, by hypothesis, the unit normal on the bottom 
portion points downward and the unit normal on the top portion points upward. 
By using trivial parametrizations and Theorem 5.28 (the Fundamental Theorem 
of Calculus), we obtain 


b = [fax dy = [ee y. g(x,y) — R(x, y, f(x, y)) d, y) 


89) OR aR 
=| Hwy adazcday = fff Sav. 
BJ f(x,y) Oz E Oz 
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o> ea 8) 


Az 


<— vertical side 


FIGURE 13.18 


Similarly, since E is of type II, 


and since E is of type II, 


P 
n= fff av. 
E Ox 


Adding 1; + I, + J; verifies the theorem. 

Since disjoint regions have disjoint boundaries, it remains to verify the Diver- 
gence Theorem for three-dimensional regions which can be divided into a finite 
number of regions each of which is of types I, II, and III (see Theorem 12.23). 
But if E = E, U E2 share a common boundary, then 


[[[ aweav = fff avFav + [ff div FdV 
E E| E, 
= F-ndo+ [f F-ndo+ [f F -ndo, 
OE SNIE, SNdEd 


where S is the common surface between £; and E2. Since £; and Ez have 
outward-pointing normals, the orientation of $9 dF; is different from the ori- 
entation of § M dE, and the integrals over S cancel each other out. | 


The next two examples show that, like Green’s Theorem, the Divergence The- 
orem can be used to avoid difficult integrals and tedious parametrizations. 
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13.55 EXAMPLE. 


Use Theorem 13.54 to evaluate f', F-ndo, where S is the topological boundary 
of the solid E = {(x, y, z): eee y? < z < 1}, mis the outward-pointing normal, 
and F(x, y, z) = (2x + 2, x° +27, cos(x2) + sin(y>) = 2) 


Solution. Since div F = 2 — 2z, it follows from the Divergence Theorem that 


20 1 1 1 
[ [ena = | @-2)av=2f [ fa-a dzdrd@=—. 8 
S E o Jo Jr2 3 


13.56 EXAMPLE. 
Evaluate SSao¥F -ndo, where Q is the unit cube [0, 1] x [0, 1] x [0, 1], nis the 
outward-pointing normal, and F(x, y, z) = (2x — z, x*y, —xz7). 


Solution. Since 0Q has six sides, direct evaluation of this integral requires six 
separate integrals. However, by the Divergence Theorem, 


1 1 1 11 
hf F-ndo = | [ [ e+x- 200 ax ay dz= 5, | 
aa 0 Jo Jo 6 


These definitions and results take on new meaning when examined in the con- 
text of fluid flow. When F represents the flow of an incompressible fluid near a 
point a, curl F(a) measures the tendency of the fluid to swirl in a counterclock- 
wise direction about a (see Exercise 13.6.6), and div F(a) measures the tendency 
of the fluid to spread out from a (see Exercise 13.5.7). (This explains the etymol- 
ogy of the words curl and divergence.) For example, if F(x, y, z) = (x, y, z), then 
the fluid is not swirling at all, but spreading straight out from the origin. Accord- 
ingly, curl F = 0 and div F = 3. On the other hand, if G(x, y, z) = (y, —x, 0), 
then the fluid is swirling around in a circular motion about the origin. Accord- 
ingly, curl G = (0, 0, —1) but div G = 0. Note the minus sign in the component of 
curl G. This fluid swirls about the origin in a clockwise direction, so runs against 
counterclockwise motion. 

When the fluid flows over a two-dimensional region E C R’, the integral of 
F.T ds over C represents the circulation of the fluid around C in the direction of 
T (see the comments following Definition 13.21). Thus Green’s Theorem tells 
us that the circulation of a fluid around dE in the direction of the tangent is 
determined by how strongly the fluid swirls inside E. When F represents the 
flow of an incompressible fluid through a three-dimensional region E C R? and 
S = dE, the integral {/, F -n represents the flux of the fluid across the surface S$ 
(see the comments following Definition 13.43). Thus the Divergence Theorem 
tells us that the flux of the fluid across § = dE is determined by how strongly 
the fluid is spreading out inside E. 

We close this section by admitting that the interpretations of curl and di- 
vergence given above are imperfect at best. For example, the vector field 
F(x, y, z) = (0, z, 0) has curl (—1, 0, 0). Here the fluid is shearing in layers with 
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flow parallel to the xy-plane in the direction of the positive y-axis when z > 0. 
Although the fluid is not swirling, it does tend to rotate a stick placed in the fluid 
parallel to the z-axis [e.g., the line segment {(0, 1, z) : 0 < z < 1}] because more 
force is applied to the top than the bottom. This tendency toward rotation is 
reflected by the value of the curl. (Notice that the rotation is clockwise and the 
curl has a negative first component.) 


EXERCISES 


13.5.1. 


13.5.2. 


13.5.3. 


13.5.4. 


For each of the following, evaluate {.. F - T ds. 


a) C is the topological boundary of the two-dimensional region in 

the first quadrant bounded by x = 0, y = O, and y 
4 — x2, oriented in the counterclockwise direction, and F(x, y) 
(sin(/ x3 — x7), xy). 

b) C is the perimeter of the rectangle with vertices (0, 0), (2, 0), 
(0, 3), (2, 3), oriented in the counterclockwise direction, and 
F(x, y) = (e’, log(x + 1)). 

c) C = C,UCo, where C; = 0B, (0, 0) oriented in the counterclockwise 
direction, C2 = 0B>(0,0) oriented in the clockwise direction, and 
F(x, y) = (f (x? + y’), xy”), where f is aC! function on [1,2]. 


For each of the following, evaluate /( . 


a) C is the topological boundary of the rectangle [a, b] x [c, d], ori- 
ented in the counterclockwise direction, and w = (f(x) + y) dx + 
xy dy, where f : [0, 1] > R is any continuous function. 

b) C is the topological boundary of the two-dimensional region 
bounded by y = x* and y = x, oriented in the clockwise direc- 
tion, and w = yf (x) dx + (x? + y*) dy, where f : [0,1] > RisC! 
and satisfies fs xf (x) dx = ty cae deaciee 

c) C is the topological boundary of a two-dimensional region E which 
satisfies the hypotheses of Green’s Theorem, oriented positively, 
and w = e* siny dy —e* cosy dx. 


For each of the following, evaluate a ie g F-ndo, where nis the outward- 
pointing normal. 


a) Sis the topological boundary of the rectangle [0, 1] x [0, 2] x [0, 3] 
and F(x, y,z)=(*+e,y+e, e). 

b) Sis the truncated cylinder x? + y* = 1, 0 < z < 1 together with the 
disks x? + y? <1, z=0,1, and F(x, y,z) = (x2, ve 22), 

c) S is the topological boundary of E, where E C R? is bounded by 
z=2-x7,z=x7, y=0, z=y,andF(x,y,2=(@+f0,20, y+ 
g(x,z),z+h(x, y)) and f, g,h: R? > RareC!. 

d) S is the ellipsoid x*/a? + y*/b? + z7/c? = 1 and F(x, y,z) = 
(xlyl|, ylzl, zlx[)- 


For each of the following, find //,@, where n is the outward-pointing 
normal. 


13.5.5. 


13.5.6. 


13.5.7 |. 


13.5.8. 
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a) S is the topological boundary of the three-dimensional region en- 
closed by y = x*, z=0, z=1, y = 4, and@ = xyz dy dz+ (x7 + 
y? +27) dz dx +(x+y+2) dx dy. 

b) S is the truncated hyperboloid of one sheet x? — y* +27 = 1, 0< 
y < 1, together with the disks 47 gt <1, y = 0, and x Ze 2, 
y = l,and w = xy|z| dy dz + x?|z| dz dx + (x3 + y3) dx dy. 

c) S$ is the topological boundary of E, where E Cc R? is bounded 
by the surfaces x7 + y+ 27 = 4 and 4x + y4+2z = 5, and 
wo = (x+y? +27) dy dzt(x* +y 427) dz dx + (x? + y* +z) dx dy. 

a) Prove that if E is a Jordan region whose topological boundary is 


a piecewise smooth curve oriented in the counterclockwise direc- 
tion, then 


1 
Area (E) = 5 [ x dy —ydx. 
2 Jae 


b) Find the area enclosed by the loop in the Folium of Descartes; that 


is, by 
3t ar 
= | —— t : 
P(t) (aa): € [0, 00) 


c) Find an analogue of part a) for the volume of a Jordan region E 
in R?. 

d) Compute the volume of the torus with radii a > b (see Exam- 
ple 13.32). 


a) Show that Green’s Theorem does not hold if continuity of P Q is 
relaxed at one point in E. [Hint: Consider P = y/(x* + y*), Q = 
—x/(x? + y*), and E = B,(0,0).] 

b) Show that the Divergence Theorem does not hold if continuity of F 
is relaxed at one point in E. 


This exercise is used in Section 13.6. Suppose that V is a nonempty, 
open set in R* and that F: V > R? is C!. Prove that 


1 
div F(xo) = lim aE lf Penie 
r0+ Vol(B;(x0)) J JaB, (xo) 


for each xg € V, where n is the outward-pointing normal of B, (xo). 
Let F,G: R® > Rand f : R° > R be differentiable. Prove the fol- 
lowing analogues of the Sum and Product Rules for the “derivatives” 
curl and divergence. 


a) V x (F+G) =(V x F)+(V x G) 


b) V x (fF) = f(V x F) + (Vf x F) 
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c) V-(fP)=Vf-F+feV- FP) 
d) V-(F+G)=V-F+V:-G 
e) V-(Fx G)=(V x F)-G-—(V xG)-F 


This exercise is used in Section 13.6. Let E Cc R*. Recall that the 
gradient of aC! function f : E — Ris defined by 


grad f=VP=(h. fy, fz). 


a) Prove that if f is C? at xo, then curl grad f (xo) = 0. 

b) IfF : E > R'is C! on E and C? at xo € E, prove that div curl 
F(xo) = 0. 

c) Suppose that E satisfies the hypotheses of the Divergence Theorem 
and that f : E > Risa C? function which is harmonic on E (see 
Exercise 13.5.10d). If F = grad f on E, prove that 


[ff fe nao = fff ier av. 


Let E be a set in R™. For each u : E — R which has second-order 
partial derivatives on E, Laplace’s equation is defined by 


a) Show that if u is C? on E, then Au = V- (Vu) on E. 
b) [GrEEN’s First Ipentiry]. Show that if E C R> satisfies the hy- 
potheses of the Divergence Theorem, then 


) (Av + Vu--Vo av = ff uVvu-ndo 
E aE 


for all C? functions u,v: E > R. 
c) [GREEN’s SECOND IDENTITY]. Show that if E C R? satisfies the hy- 
potheses of the Divergence Theorem, then 


[ff wav—vawav = [f (uVv — vVu)-ndo 
E aE 


for all C? functions u,v: E > R. 

d) A function u : E > Ris said to be harmonic on E if and only if u is 
C? on E and Au(x) = 0 for all x € E. Suppose that E is a nonempty 
open region in R? which satisfies the hypotheses of the Divergence 
Theorem. If u is harmonic on E, u is continuous on E, and u = 0 
on JE, prove that u = 0 on E. 
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e) Suppose that V is open and nonempty in R’, that u is C? on V, and 
that u is continuous on V. Prove that uw is harmonic on V if and 
only if 


i (ux dy — uy dx) =0 
dE 


for all two-dimensional regions E C V which satisfy the hypotheses 
of Green’s Theorem. 


13.6 STOKES’S THEOREM 


Our final fundamental theorem applies to surfaces in R* whose boundaries are 
curves. 


13.57 Theorem. [STOKES’S THEOREM]. 
Let S be an oriented, piecewise smooth C* surface in R? with unit normal n. 
If the boundary 8S is a piecewise smooth C! curve oriented positively and F : 


S > R? is C!, then 
/ F-Tds= [f cul F nao. 
as Ss 


PROOF FOR SPECIAL REGIONS. We will prove Stokes’s Theorem when E is a 
finite union of nonoverlapping explicit C? surfaces which lie over “Green’s 
regions” (i.e., two-dimensional regions which satisfy the hypotheses of Green’s 
Theorem). For a proof of Stokes’s Theorem as stated, see Spivak [12]. 

Suppose first that S is a single explicit C? surface which lies over a “Green’s 
region” E. Let F = (P, Q, R) be C! on S and write the line integral in differential 
notation: 


/ F-tTas= [ Pdx+Qdy+R dz. 
as as 


Without loss of generality, suppose that S$ is determined by z = f(x,y), 
(x,y) € E, where f : E > Ris aC? function, and that S is oriented with 
the upward-pointing normal. Thus n = N/||N ||, where N = (— fx, —fy, 1). 

Let (g(t), A(t)), t € [a, b], be a piecewise smooth parametrization of 0E ori- 
ented in the counterclockwise direction. Then 


b(t) = (8, AM), f(g, h@)), t € [a, 5], 


is a piecewise smooth parametrization of dS which is oriented positively (see 
Figure 13.19). Ifx = g(t), y = h(t), and z = f(g(t),h()), then dx = 
g(t) dt, dy =h'(t) dt, and 
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z= fy) 


ue g(t) 
| 
| | 
| 
E \ (win, 00) 
FIGURE 13.19 
Thus, by definition, 
0z Oz 
Pdx+Qdy+Rdz=| (P+R—)dx+(Q+R—) dy. (10) 

as dE ax dy 


We shall apply Green’s Theorem to this last integral. By the Chain Rule and 
the Product Rule, 


OLR z 90 , dQdz_ IRAz , IR Az Az a°z 
ax dz 0x dxdy dz Ox dy dx dy 


and 


0 ‘ Oz a OF Oe ON OR OR OE, oi 82z 
dy ax) dy  azdy dydx az day dx dy Ox 


Since z = f(x, y) is C’, the mixed second-order partial derivatives above are 


equal. Therefore, 
az 0 Oz 
P+R 
ar (e+e 7) m (P+ Ras) 
2) (-3 =) (= all =) +(22 ~) 
"\ az ax dy ox dy 


Hence, it follows from (10), Green’s Theorem, and (9) that 


a 
/ F-tds= [ cutk-wac.y)= ff cu F-nao. 
0s E Ss 


Stokes’s Theorem for finite unions of pairwise disjoint explicit C? surfaces 
which lie over Green’s regions follows immediately. If the surfaces are con- 
tiguous, the common boundaries (as in the proofs of Green and Gauss) appear 
twice, each time in a different orientation and, hence, cancel each other out. Hf 


Stokes’s Theorem can be used to replace complicated line integrals by simple 
surface integrals. 
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13.58 EXAMPLE. 


Compute {..F - T ds, where C is the circle x? + z? = 1, y = 0, oriented in 
the counterclockwise direction when viewed from far out on the y-axis, and 


F(x, y,2) = (22+ Va3 + x7 +2, xy, xy + VF +22 +2). 


Solution. Since curlF = (x, x? — y, y), using Stokes’s Theorem is considerably 
easier than trying to integrate F - T ds directly. Let S be the disk x? + 2? < 1, 
y = 0, and notice that dS = C. Since C is oriented in the counterclockwise di- 
rection, the normal to S must point toward the positive y-axis [i.e., n = (0, 1, 0)]. 
Thus curl F-n = x* — y = x? on S, and Stokes’s Theorem implies 


Qn 1 1 
[eves ff ear [ | Pe0s?@ dr a0 = ©. | 
fal Ss 0 JO 4 


In Example 13.58, we could have chosen any surface S whose boundary is C. 
Thus Stokes’s Theorem can also be used to replace complicated surface integrals 
by simpler ones. 

13.59 EXAMPLE. 


Find Phe curl F -ndo, where S is the semiellipsoid 9x? + 4y? + 3627 = 36, z > 0, 
nis the upward-pointing normal, and 


F(x, y, z) = (cosx sinz + xy, x°, er te _ eye? + tan(xy)). 


Solution. Let C = 0S. The integral of curl F - n do over S and the integral of 
F . Tds over C are both complicated. But, by Stokes’s Theorem, the integral of 
F - Tds over C is the same as the integral of curlF - ndo over any oriented C? 
surface E satisfying 9E = C. Let E be the two-dimensional region 9x* + 4y? 
< 36. On E, n= (0,0, 1). Thus we only need the third component of curl F: 


a a 
(curl F)3 := —= (£3) — —(cosx sinz+xy) = 3x7 — x. 
ax dy 


Therefore, 


II curl F -ndo = f o2-» d(x, y). 
S E 


Let x = 2rcos6 and y = 3rsin@. By a change of variables, 


20 1 
/ (3x? — x) d(x, y) = / / (12r? cos? @ — 2r cos@)6r dr d0 = 187. 
E 0 0 


Stokes’s Theorem can also be used to replace complicated surface integrals 
by simple line integrals. 
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13.60 EXAMPLE. 


Let S be the union of the truncated paraboloid z = x7 + y?, 0 < z < 1, and the 
truncated cylinder xo ae =1, 1<z<3. Compute 


[[ ena. 
S 


where n is the outward-pointing normal and F(x, y, z) = (x + z*,0, —z — 3). 


Solution. The boundary of S is x? + y* = 1, z = 3. To use Stokes’s Theorem, 
we must find a function G = (P, QO, R) : S > R° such that curlG = F; that is, 
such that 


dR dQ > 
ee ea : 11 
dy Oz ee 
dP aR 
= 0, 12 
eae (12) 
and 
dQ OP 
= 3: 13 
ax dy ‘i (13) 
Starting with (11), set 
a aR 
ay fads oe (14) 
Oz dy 


The left side of (14) implies Q = —xz+ g(x, y) for some g : R? > R. Similarly, 
the right side of (14) leads to R = z?y + h(x, z) for some h : R* — R. Thus 
QO, = —z+ 8, will solve (13) if we set g = 0 and P, = 3; that is, P = 3y +o(x, z) 
for some o : R? - R. Hence, P, — Ry = 0; — hx will satisfy (12) ifo =h = 0. 
Therefore, P = 3y, Q = —xz and R = yz’; that is, G = (3y, —xz, yz’). 
Parametrize 0S by @(t) = (sint, cost, 3), t € [0, 27], and observe that 


(God): ¢' = Bcost, —3sint, 9cos fr) - (cost, — sint, 0) = 3cos*t +3 sin? ¢ = 3. 


Consequently, Stokes’s Theorem implies 


2n 
[[¥ nae = [fowie nao = [ G-Tas= | 3 dt = 6x. | 
S S as 0 


The solution to Example 13.60 involved finding a function G which satisfied 
curlG = F. This function is not unique. Indeed, we could have begun with 


a R 
ag =-z* and oe =x 
dz dy 
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instead of (14). This leads to a different solution: 


G(x, y, 2) = (zy, —Bx +. 27/3), xy). 


The technique used to solve Example 13.60, however, is perfectly valid. Indeed, 
by Stokes’s Theorem the value of the oriented line integral of G - T will be the 
same for all C! functions G which satisfy curl G = F. 

This technique works only when the system of partial differential equations 
curlG = F has a solution G. To avoid searching for something which does not 
exist, we must be able to discern beforehand whether such a solution exists. To 
discover how to do this, suppose that G is a C? function which satisfies curlG = F 
on some set E. Then divF = 0 on E by Exercise 13.5.9b. Thus the condition 
div F = 0 is necessary for existence of a solution G to curlG = F. The following 
result shows that if EF is nice enough, this condition is also sufficient. 


13.61 Theorem. Let Q be a ball or a rectangle with nonempty interior, and sup- 
pose that F : 2 — R? is C! on Q. Then the following three statements are equiva- 
lent. 


i) There is aC? function G : Q > R° such that curlG = F on Q. 
ii) If Eand S = 0E satisfy the hypotheses of the Divergence Theorem and E Cc 


Q, then 
[[# nao =o (15) 
S 


iii) The identity div F = 0 holds everywhere on Q. 


Proof. If i) holds, then div F = div (curl G) = 0 since the first-order partial 
derivatives of G commute. Thus (15) holds by the Divergence Theorem. (This 
works for any set Q.) 

If ii) holds, then by the Divergence Theorem and Exercise 13.5.7, 


1 
lim ome I ff div FdV 
r—>0+ Vol (B; (x0)) r (Xo) 


1 
Sik iaedaes 
r>0+ Vol (B;(X0)) J JaB, (x) 


for each xp € Q°. Since div F is continuous on Q, it follows that div F = 0 
everywhere on Q. (This works for any three-dimensional region Q.) 

Finally, suppose that iii) holds. Let F = (p, q,r) and suppose for simplicity 
that G = (0, Q, R). If curlG = F, then 


div F(xo) 


Ry—Q,;=p, —R,=4q, pe (16) 


If Q is a ball, let (xo, yo, zo) be its center; if Q is a rectangle, let (xo, yo, zo) be 
any point in Q. Then given any (x, y, z) € Q, the line segment from (x9, y, z) 
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to (x, y, z) is a subset of Q. Hence we can integrate the last two identities in 
(16) from xo to x, obtaining 


x x 
R= -| qu, y,z)du+g(y,z) and Q = r(u, y,z) du + h(y, z) 
x x0 


0 


for some g,h : R* -— R._ Differentiating under the integral sign 
(Theorem 11.5), and using condition iii), the first identity becomes 


x 
p=Ry-Q,= -| (qy(, y,z) +rz(u, y, z)) du + gy —hz 
x0 


x 
=i Px, y,Z) du + gy —hz = p(x, y, Zz) — p(x, y, Z) + By — hz. 
x 


0 


Thus (16) can be solved by gy = p(xo, y, z) and h = 0; that is, 


x y x 
o- | r(u,y,z)du and r= | P(x0, V, Z) av | qtu,y,z)du. 
x0 x 


YO 0 


We notice that Theorem 13.61 holds for any three-dimensional region Q 
which satisfies the following property: There is a point (x0, yo, zo) € such 
that the line segments L((x0, y, z); (x, y, z)) and L((x0, yo, Z); (xo, y, Z)) are both 
subsets of Q for all (x, y,z) € &. However, as the following result shows, 
Theorem 13.61 is false without some restriction on Q. 


13.62 Remark. Let Q = B,(0, 0, 0)\{(0, 0, 0)} and 


x y Zz 
F(x, y,z = ( ; ; \ 
(X,Y, Z) pare? pale apes 


where w = w(x, y,z) = x° + y? +27. Then div F = 0 0n Q, but there is no G 
which satisfies curlG = F. 


Proof. By definition, 


—2x2 2 2 2 _ Jy2 2 2 2_ 972 
Cy ema as ee eG 
wile moe ee 


Let S represent the unit sphere 0B,(0,0,0) oriented with the outward- 
pointing normal, and suppose that there is a G such that curlG = F. On 
the one hand, since F = (x, y,z) = non S implies F-n = ve+y+7=1, 


we have 
[ [endo = [fda sors) = 4. (17) 
S S 
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On the other hand, dividing S$ into the upper hemisphere S; and the lower 
hemisphere S2, we have by Stokes’s Theorem that 


[fe ado = [fv nde + [ff P-mdo ae 


= G-Tas+ [ G-To ds =0. 
0S) 


0S2 
This last step follows from the fact that 0S; = dS. and Tj; = —T>. Since 
(17) and (18) are incompatible, we conclude that there is no G which satisfies 
curlG = F. a 
EXERCISES 


13.6.1. For each of the following, evaluate {. F - T ds. 


a) C is the curve formed by intersecting the cylinder x? + y* = 1 with 
z = —x, oriented in the counterclockwise direction when viewed 
from high on the positive z-axis, and F(x, y, z) = (xy?, 0, xyz). 

b) C is the intersection of the cubic cylinder z = y° and the cir- 
cular cylinder x? + y* = 3, oriented in the clockwise direction 
when viewed from high up the positive z-axis, and F(x, y,z) = 
(e* +z, xy, ze). 


13.6.2. For each of the following, evaluate ff, curl F -ndo. 


a) Sis the “bottomless” surface in the upper half-space z > 0 bounded 
by y = x*, z = 1 — y, mis the outward-pointing normal, and 
Fa,y,2=( sin z?, y COs 2, xo+ ye + 2). 

b) Sis the truncated paraboloid z = 3—x?—y’, z > 0, mis the outward- 
pointing normal, and F(x, y, z) = (y, xyz, y). 

c) S is the hemisphere z = 10 —x? — y?, n is the inward-pointing 
normal, and F(x, y, z) = (x, x, x7y? log(z + 1)). 

d) S is the “bottomless” tetrahedron in the upper half-space z > 0 
bounded by x = 0, y=0, x +2y+3z = 1, z > 0, mis the outward- 
pointing normal, and F(x, y, z) = (xy, yz, xz). 


13.6.3. For each of the following, evaluate /'/, F -ndo using Stokes’s Theorem 
or the Divergence Theorem. 


a) S is the sphere x? + y? + z? = 1, nis the outward-pointing normal, 
and F(x, y, z) = (xz, x*y — z3, xy + y?z). 

b) Sis the portion of the plane z = y which lies inside the ball B; (0), n 
is the upward-pointing normal, and F(x, y, z) = (xy, xz, —yz). 

c) Sis the truncated cone y = 2V x2 + z2, 2 < y < 4, mis the outward- 
pointing normal, and F(x, y, z) = (x, —2y, z). 

d) Sisa union of truncated paraboloids z = 4 a y’, 0<z<4,and 
Zaxet y? —4, —4<z <0,mis the outward-pointing normal, and 


Fa,y,z2=(& pay + sin z, x spay + cos z, cosx + siny + Zz). 
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13.6.4. 


13.6.5. 
13.6.6. 


13.6.7. 


13.6.8. 
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e) S is the union of three surfaces z = x y? (0 <z< 2),2= 
x* + y* (2 < z < 5), andz = 7—x* — y* (5 < z < 6), nis the 
outward-pointing normal, and F(x, y, z) = (2y, 2z, 1). 


For each of the following, evaluate /, w using Stokes’s Theorem or the 
Divergence Theorem. 


a) S is the topological boundary of cylindrical solid y? + 2? < 9, 0 < 
x < 2, with outward-pointing normal, and wm = xydydz + (x? — 
2?) dzdx +.xzdx dy. 

b) S is the truncated cylinder x74+77=8,0< y < 1, with outward- 
pointing normal, and w = (x — 2z)dydz — ydzdx. 

c) Sis the topological boundary of R = [0, 7/2] x [0, 1] x [0, 3], with 
outward-pointing normal, and w = e) cosx dydz+x?zdzdx +(x + 
y+z)dxdy. 

d) S is the intersection of the elliptic cylindrical solid 2x7 + 27 < 1 
and the plane x = y, with normal which points toward the positive 
x-axis, and w = xdydz — ydzdx + sinydx dy. 


Prove that Green’s Theorem is a corollary of Stokes’s Theorem. 

Let IT be a plane in R? with unit normal n and xp € J7. For eachr > 0, 
let S, be the disk in IT centered at xo of radius r [ie., S, = B-(x9) N JT]. 
Prove that if F : B,(xo) — R®? is C! and dS, carries the orientation 
induced by n, then 


curl F(xp)-n= lim F-.Tds. 


r>0+ 0 (S;) Jas, 


Let S be an orientable surface with unit normal n and nonempty bound- 
ary 0S which satisfies the hypotheses of Stokes’s Theorem. 


a) Suppose that F : § —> R*\{0} is C!, that 4S is smooth, and that T 
is the unit tangent vector on 0S induced by n. If the angle between 
T(x) and F(xo) is never obtuse for any x9 € 4S, and ff, curlF - 
ndo = 0, prove that T(xp) and F(xo) are orthogonal for all xg € dS. 

b) IfF, F, : S > R? are C! and F;, — F uniformly on 4S, prove that 


lim II curly -ndo = [[f curlF -ndo. 
k->oo S Ss 


Suppose that E is a two-dimensional region such that if (x, y) € E, 
then the line segments from (0, 0) to (x, 0) and from (x, 0) to (x, y) are 
both subsets of E. If F : E — R? is C!, prove that the following three 
statements are equivalent. 


a) F=Vf on E forsome f: E> R. 

b) F = (P, Q) is exact (i.e., Qy = Py on E). 

c) {.F-Tds = 0 for all piecewise smooth curves C = dQ oriented 
counterclockwise, where Q is a two-dimensional region which satis- 
fies the hypotheses of Green’s Theorem, and Q c E. 


13.6.9. 


13.6.10. 


13.6.11. 
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Let Q be a three-dimensional region and F : Q > R? be C! on Q. 
Suppose further that, for each (x, y,z) € , both the line segments 
L((x, y, 0); (x, y, z)) and L((x, 0, 0); (x, y, 0)) are subsets of Q. Prove 
that the following statements are equivalent. 


a) There is a C? function G : Q — R? such that curlG = F on Q. 
b) If F, E, and S = dE satisfy the hypotheses of the Divergence Theo- 


rem and E c Q, then 
[[# nao =o 
Ss 


c) The identity div F = 0 holds everywhere on Q. 


Suppose that E satisfies the hypotheses of the Divergence Theorem and 
that S satisfies the hypotheses of Stokes’s Theorem. 


a) If f : S > RisaC? function and F = grad f on S, prove that 


I (fF) -T ds =0. 
os 


b) IfG: E > R° is aC? function and F = curlG on E, prove that 


[crm nde = fff ead pray. 


Note: You may wish to use Exercises 13.5.8 and 13.5.9. 
Let F be C! and exact on R?\{(0, 0)} (see Exercise 13.6.8b). 


a) Suppose that C; and C} are disjoint smooth simple curves, oriented 
in the counterclockwise direction, and that E is a two-dimensional 
region whose topological boundary 0£ is the union of C; and C2. 
(Note: This means that E has a hole with one of the C;’s as the 
outer boundary and the other as the inner boundary.) If (0,0) ¢ E, 


prove that 
[Pe -tas= F.Tds. 
Ci C2 


Suppose that E is a two-dimensional region which satisfies 
(0,0) € E°. If JE is a smooth simple curve oriented in the coun- 
terclockwise direction, and 


—y x 
F(x, y) = (5. =) ’ 


compute f,,,F-Tds. 
State and prove an analogue of part a) for functions F : R°\{(0, 
0, 0)}, three-dimensional regions, and smooth surfaces. 


b 


wa 


c 


wa 


CHAPTER 14 


Fourier Series 


*14.1 


INTRODUCTION 


This section uses no material from any other enrichment section. 
In Chapter 7 we studied power series and their partial sums, classical polyno- 
mials. In this chapter, we shall study the following objects. 


14.1 Definition. 
Let az, by € Rand let N be a nonnegative integer. 
i) A trigonometric series is a series of the form 


ao 


CO 
5 + Soa coskx + by sinkx). 


k=1 
ii) A trigonometric polynomial of order N is a function P : R > R of the form 
ao al 
P(x)= Pi + 2 (01 cos kx + by sinkx). 


[Here, cos kx is shorthand for cos(kx), and sinkx is shorthand for sin(kx).] 


Calculus was invented with the tacit assumption that power series provided 
a unified function theory; that is, every function has a power series expan- 
sion (see Kline [5]). When Cauchy showed that this assumption was false (see 
Remark 7.41), mathematicians began to wonder whether some other type of 
series would provide a unified function theory. Euler (respectively, Fourier) had 
shown that the position of a vibrating string (respectively, the temperature along 
a metal rod) can be represented by trigonometric series. Thus, it was natural to 
ask, Does every function have a trigonometric series expansion? In this chap- 
ter we shall examine this question, and the following calculation will help to 
answer it. 


598 
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14.2 Lemma. [ORTHOGONALITY]. 
Let k, j be nonnegative integers. Then 


20 k=j=0 


18 


i) coskx cos jxdx = 47 k=j 40 
= 0 k#j 
4 k —j] 
ii) sinkx sin jxdx = Mi J a 
a 0 kAj 
and 
4 
iii) sinkx cos jxdx = 0. 
—1 
Proof. Let 


a 
I =| cos kx cos jx dx. 


—T 


Ifk = j =0,then/ = f" dx = 2x. Ifk = j 4 0, then by a half-angle 
formula and elementary integration, we have 


wT 1 Tw 
I = cos* kx dx = >| (1 + cos2kx) dx =a. 
TT 


—I ae 


And if k # j, then by a sum-angle formula and elementary integration, we 
have 


1 us 
I= >| (cos(k + j)x + cos(k — j)x)dx = 0. 
—1 


This proves part i). Similar arguments prove parts ii) and iii). | 


Notice that the question concerning representation of functions by trigono- 
metric series has a built-in limitation. A function f : R — R is said to be 
periodic (of period 27) if and only if f(x + 27) = f(x) for all x € R. Since 
coskx and sinkx are periodic, it is clear that every trigonometric polynomial 
is periodic. Therefore, any function which is the pointwise or uniform limit of 
a trigonometric series must also be periodic. For this reason, we will usually 
restrict our attention to the interval [—z, 7] and assume that f(—7z) = f(z). 

The following definition, which introduces a special type of trigonometric 
series, plays a crucial role in the representation of periodic functions by trigono- 
metric series. 
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14.3 Definition. 
Let f be integrable on [—z, z] and let N be a nonnegative integer. 
i) The Fourier coefficients of f are the numbers 


4 


1 
an(f) = — f (x) coskx dx, KOs dha 
—T 


and 
1 ie 
be(f) = - f(x) sinkx dx, KeSeb 2 td 
W 


SIG 


ii) The Fourier series of f is the trigonometric series 


ago(f) 


aa YS (ax(f) cos kx + by(f) sin kx). 


k=1 


(Sf) = 


iti) The partial sum of Sf of order N is the trigonometric polynomial defined, 
for each x € R, by (So f)(x) = ao(f)/2 if N = 0, and 


N 
+ SV (ax(f) cos kx + by(f) sinkx) 
k=1 


(Sw f)(x) = 


ao(f) 
2 


ifN EN. 


The following result shows why Fourier series play such an important role in 
the representation of periodic functions by trigonometric series. 


14.4 Theorem. [FOURIER]. 
If a trigonometric series 


CO 
ao : 
S:= — ) k. b k 
5 ge es x + by sinkx) 


converges uniformly on R to a function f, then S is the Fourier series of f; that 
is, ay = ag(f) fork =0,1,..., and by = by(f) fork = 1,2,.... 


Proof. Fix an integer k > 0. Since 


Co 
a sen ds 
faa tt ) (aj cos jx + b; sin jx) 
j=l 
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converges uniformly and cos kx is bounded, 


CO 
f(x) coskx = > coskx + S (aj cos jx coskx + bj; sin jx cos kx) (1) 
j=l 


also converges uniformly. Since f is the uniform limit of continuous functions, 
f is continuous and, hence, integrable on [—z,z]. Integrating (1) term by 
term and using orthogonality, we obtain 


Tv 


an(f) = f (x) coskx dx 


—T 


ao T 0° a; 4 
= oo f coskx dx + ) al cos kx cos jx dx 
20 Jz oo NH Jz 


b; cia 
dived. cos kx sin jx ax) 
UW J—n 


= dk. 
A similar argument establishes by (f) = bx. | 


There are two central questions in the study of trigonometric series. 


THE CONVERGENCE QUESTION. Given a function f : R > R, periodic on R and 
integrable on [—1, 1], does the Fourier series of f converge to f? 


THE UNIQUENESS QUESTION. [fa trigonometric series S converges to some func- 
tion f integrable on |—7, x], is S the Fourier series of f? 


We shall answer these questions for pointwise and uniform convergence 
when f is continuous and of bounded variation. We notice in passing that, by 
Theorem 14.4, the answer to the Uniqueness Question is yes if uniform conver- 
gence is used. 

The following special trigonometric polynomials arise naturally in connection 
with the Convergence Question (see Exercise 14.1.2). 


14.5 Definition. 
Let N be a nonnegative integer. 


i) The Dirichlet kernel of order N is the function defined, for each x € R, by 
Do(x) = 1/2 if N = 0, and 


N 
1 
Dy(x) = 5 + Y > cos kx 
k=1 


if N EN. 
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14.5 Definition. (Continued) 


ii) The Fejér kernel of order N is the function defined, for each x € R, by 
Ko(x) = 1/2 if N =0, and 


Ky(x) = 


Nile 


+3 (1 Ay) cont @) 


k=1 


if N EN. 


The following result shows that there is a simple relationship between Fejér 
kernels and Dirichlet kernels. 


14.6 Remark. /f N is a nonnegative integer, then 


D ees D 
in= wa N(x) 


forallx ER. 


Proof. The identity is trivial if N = 0. To prove the identity for N € N, fix 
x ER. By definition, 


N 
Lf 
Ky(x) = Eel (a +N -k+ b sok 


k=1 

1 1 WN 
“NW 2 cae cos kx 

k=1 j=k 

oe en 6 oe Do(x) +++» + Dy(x) 
=... = = k = 

Wl o> ae Nel 

j= ——% 


The next result shows that Dirichlet and Fejér kernels can be represented by 
quotients of trigonometric functions. 


14.7 Theorem. [fx € R cannot be written in the form 2kx for any k € Z, then 


sin (w + 3) x 
Dy (x) = Sane = (3) 
2 
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and 
2 
sin (44) x 


K = 
n(x) N+1 2 sin 5 


(4) 


forN =0,1,.... 


Proof. The formulas are trivial for NV = 0. Fix N € N. Applying a sum-angle 
formula and telescoping, we have 


1 
(Dw) _ 5) sin 5 = Y “cos kx sin 5 


Solving this equation for Dy (x) verifies (3). 
Let k EN. By (3) and another sum-angle formula, 


2% D6 9s 1 1 
Dx(x) sin cia ae ae kts x = 7 (cos kx — cosh + 1x). 


This identity also holds for k = 0. Applying Remark 14.6 and telescoping, we 
have 


(N + 1) Ky (x) sin? 5 = S~ Dy(x) sin? 5 


N 
1 
=a Y “(cos kx — cos(k + 1)x) 


k=0 
1 1 N+1 
= 7(l —cos(W + Dx) = 5 sin? (=) x 
Solving this equation for Ky (x) verifies (4). | 


These identities will be used in the next section to obtain a partial answer to 
the Convergence Question. 

The next two examples illustrate the general principle that the Fourier coeffi- 
cients of many common functions can be computed using integration by parts. 
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14.8 EXAMPLE. 


Prove that the Fourier series of f(x) = x is 


oO ei 
25— k sinkx. 
k=1 


Proof. Since x coskx is odd and x sinkx is even, we see that a,(f) = 0 for 
k=0,1,..., and 


wo TT 
nif == f x sinkx dx 
0 


fork = 1,2,.... Integrating by parts, we conclude that 


2( xcoskx », 11 {7 ery 
bi(f) = = i Fs Te; : cos kx dx a | 


14.9 EXAMPLE. 


Prove that the Fourier series of f(x) = |x| is 


X 25, ey 
Ce com (2k — 1)? 


Proof. Since |x| cos kx is even and |x|sinkx is odd, we see that b,(f) = 0 for 
k=1,2..., and 


2 a 
aif == f x coskx dx 
wT JO 
fork =0,1,....Ifk =0, then 


2 2 
an(f) = — (=) =1; 


that is, ag( f)/2 = 1/2. Ifk > 0, then integration by parts yields 


2 1) if k is even, 
Be I) = apy Osa d) e4 if k is odd. 7 
Tw 


EXERCISES 


14.1.1. Compute the Fourier series of x? and of cos? x. 
14.1.2. Prove that if f : R > Ris integrable on [—7, zr], then 


Tv 


1 
(Sw f(x) = = f@)Dn@ — t)dt 


—T 


for allx € [-z,a] and N EN. 
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14.1.3. Show that if f, g are integrable on [—z, m7] anda € R, then 


a(f +g) =al(f)+ar(g), alaf) =aa(f), k=0,1,..., 


and 


b(f +g) =bil(f) + ble), belaf) =abk(f), ree 


14.1.4. Suppose that f : R — R is differentiable and periodic and that f’ is 
integrable on [—z, wz]. Prove that 


ax(f’) =kby(f) and dy (f!) = —kax(f), keEN. 
14.1.5. Suppose that fy : [-z,a] — R are integrable and that fy — f uni- 
formly on [—z, a] as N > o. 


a) Prove that ax(fy) — ag(f) and by( fy) > bg(f), as N — oo, uni- 
formly in k. 
b) Show that part a) holds under the weaker hypothesis 


dim, [7 fe) — fv()lax =0. 


14.1.6. Let 


x 


Aydin Or" 
0 


x=0. 


a) Compute the Fourier coefficients of f. 
b) Prove that 


2 (* sin2Nt 
(Say f(a) = = | eNO, 
x Jo  sint 


forx €[-7z,a]andN EN. 
*c) [GIBBS’S PHENOMENON]. Prove that 


lim (Soy f)( 7 ) z [ Sin ae 2 1,179 
1m — ~ 1. . 
N->oo se 2N mw Jo t 


*14.2. SUMMABILITY OF FOURIER SERIES 


This section uses material from Section 14.1. 

The Convergence Question posed in Section 14.1 is very difficult to answer, 
even for continuous functions. In this section we replace it with an easier ques- 
tion and show that the answer to this question is yes. Namely, we shall show that 
the Fourier series of any continuous periodic function f is uniformly summable 
to f. By summable, we mean the following concept. 
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14.10 Definition. 


A series )°¢29 ax with partial sums sy = Yip ax is said to be Cesaro 
summable to L if and only if its Cesdro means 


rs _ SOT ++ TSN 
ae 


converge to L as N — ow. 


The following result shows that summability is a generalization of conver- 
gence. 


14.11 Remark. /f )°7°.) ax converges to a finite number L, then it is Cesdro 
summable to L. 


Proof. Let « > 0. Choose N; € N such that k > N, implies |s, — L| < ¢/2. 
Use the Archimedean Principle to choose Nz € N such that Nz > N; and 


mn EN? 
k=0 
If N > No, then 
I N l N 
lov -L| < Yo lse — LI] + > Ix -L 
Nerd Ty N+ 1 yd 
ts EN) see N-WN, Paake 
< =e. 
NET. 2 NEI ae a 


The converse of Remark 14.11 is false. Indeed, although the series )7?°.9(— 1* 
does not converge, its Cesaro means satisfy 


N+2 


—___—_ N is even 
2(N + 1) 

1 

= N is odd, 

2 


whence oy — 1/2 as N > oo. 

It is easier to show that a series is Cesaro summable than to show that it 
converges. Thus the following question is easier to answer than the Convergence 
Question. 


THE SUMMABILITY QUESTION. Given a function f : R > R, periodic on R and 
integrable on [—1, 1], is Sf Cesdro summable to f? 


The Cesaro means of a Fourier series Sf are denoted by 


s Aaaks 
(on f)(x) = (SoM @) a NPV) 
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N = 0,1,.... The following result shows that the Cesaro means of a Fourier 
series can always be represented by an integral equation. This is important 
because it allows us to estimate the remainder oy f — f, using techniques of 
integration. 


14.12 Lemma. 
Let f :R > R be periodic on R and integrable on [—1, x]. Then 


18 


1 
(on f)(X) = = f(x —t)Ky(t) dt 


forall N =0,1,..., andallx ER. 
Proof. Fix j, N €¢ Nand x € R. By definition and a sum-angle formula, 


aj(f) cos jx + bj (f) sin jx 
as 1 1 
=e f(u) cos jucos jx du + — fu) sin ju sin jx du 


ea 
=— f @)(cos jucos jx + sin ju sin jx) du 
—1 


euie ” RGN ees Ge aivane 
WW J—n 


Summing this identity over integers j = 1, 2,...,k and adding ag(f)/2, we 
have 


k 
(Sef)@) = wots Sai (f) 00s fx + bi(f)sin jx) 
j=l 
17 le 
=— - f@) 5 oo ene —u)]| du 


ae f(u) D(x — u) du 
uw 


—T 


for k = 0, 1,.... Making the change of variables t = x — u and using the fact 
that both f and D, are periodic, we obtain 


Tv 


1 
(SKA) =— f  fe-)De(dt, — k=0,1,.... 


—T 
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We conclude by Remark 14.11 that 


N 
1 
(ow @) =F LAO 
1 a 


= == an i f(x —1t)D,(t) dt = Ee f(x —t)Kn(t) dt 
Ne la —1 WT Jn , 


To answer the Summability Question we need to know more about Fejér 
kernels. The following result shows that Fejér kernels satisfy some very nice 
properties. 


14.13 Lemma. 
For each nonnegative integer N, 


Kn(t)>0 forallt eR, (5) 
and 


-[ Ky(t)dt =1. (6) 
Jn 


Moreover, for each0 <é <a, 
a 
lim / [Kw (t)| dt = 0. (7) 
N>o J5 


Proof. Fix N > 0. Ift =2j for some j € Z, then Dy(t) =k + 1/2 = 0 for all 
k > 0, whence Ky (t) > 0. Ift 4 2j for any j € Z, then, by Theorem 14.7, 


sin (4): 
> 0. 


N+1 2 sin 5 = 


Ky(t) = 


This proves (5). By Definition 14.5 and orthogonality, 


[ kwoa=[- (; »(1 wz) sos) dt =n. 


k=1 


This proves (6). 
To prove (7), fix 0 <6 < 7 and observe that sint/2 > siné/2 fort € [6, z]. 
Hence, it follows from Theorem 14.7 that 


2 
1 2 x {sin (SH) « X 
if |Ky(t)|dt < ip dt < 
3 3 


~N+1 2sin $ ~ 2(N + 1) sin? 3° 


Since 6 is fixed, this last expression tends to 0 as N > oo. a 
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Using these properties, we can answer the Summability Question for contin- 
uous functions (see also Exercises 14.2.6 and 14.2.8). 


14.14 Theorem. [FEJER]. 
Suppose that f :R — Ris periodic on R and integrable on [—7, 7]. 


i) If 


._ fG@o+h) + f(xo—h) 
L= lim 
h>0 2 
exists for some xo € R, then (on f)(xo) > Las N > oo. 
ii) If fis continuous on some closed interval I, then oy f > f uniformly on I 
as N > ~. 


Proof. Since f is periodic, we may suppose that x9 € [—z, 7]. Fix N € N. By 
(6), Lemma 14.12, and a change of variables, 


Tv 


1 
(on f)(%0) —L = - | Kn(t)(f (xo — t) — L) dt 


—t 


= [kuin (Oto L) dt (8) 
wT JO 2 


== Ky (t)F (xo, t) dt. 
uw JO 


Let e > 0 and choose 0 < 6 < w such that |r| < 6 implies |F (x0, t)| < ¢/3. 
By (5) and (6) we have 


2 f° 2% 2 
- | Ky(t)|F (ao, t)|dt < =| Ky(t)at < =. (9) 
wT JO 3x Jo 3 


On the other hand, choose by (7) an Nj € N such that N > AN, implies 
JS, Kn(t) dt < €/3M, where M := sup,cr|F(x)|. Then 


2 (7 a € 
- | Ky(O|F (x0, t)| dt em | Kyide 2, 
Ss 5 3 


and it follows from (8) and (9) that 


2 2 ¢% 
chen ue = [ Kw(DIFGo. Nldt+= f Ky(t)|F (ao, t)|dt <e (10) 


for all N > N,. This proves part i). 

To prove part ii), suppose that f is continuous on some closed interval /. 
Since f is periodic, we may suppose that J C [—z,z]. Thus / is closed and 
bounded, and f is uniformly continuous on J. Repeating the estimates above, 
we see that (10) holds uniformly for all xo € J. a 
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14.15 Corollary. If f : R — Ris continuous and periodic, then oy f converges 
to f uniformly on Ras N > om. 


14.16 Corollary. [COMPLETENESS]. Jf f : R — Ris continuous and periodic, 
and ax_\(f) = by(f) = 0 fork EN, then f(x) =0 forallx ER. 


Proof. By hypothesis, (oy f)(x) = 0 for all N € N and x e€ R. Hence, by 


Corollary 14.15, f(x) = limy+(on f)(x) = 0 for all x ER. a 
14.17 Corollary. Let f : R — R be continuous and periodic. Then there is a 
sequence of trigonometric polynomials T,, Tr,..., such that Ty —> f uniformly 
onR. 

Proof. Set Ty = on f for N €N, and apply Corollary 14.15. a 
EXERCISES 


14.2.1. Let E C R and suppose that f, f, : R — R are bounded functions. 
Prove that if °°. 9 fe(x) converges to f(x) uniformly on E, then 


N 


on(x):= >> (1- a wot) fi(x) 


k=0 


converges to f(x) uniformly on E as N > oo. 

14.2.2. If f : R > Ris periodic on R and integrable on [—7, zr], prove that the 
Cesaro means of Sf are uniformly bounded; that is, there is an M > 0 
such that 


(on f)(x)| = M 


forallx €RandN EN. 
14.2.3. Let 


CO 
ie S + 2 (or cos kx + by sinkx) 


be a trigonometric series and set 


On(X) = 4y(1- Hp) ae cos + by sin 


for x € Rand N EN. Prove that S is the Fourier series of some contin- 
uous periodic function f : R > Rif and only if oy converges uniformly 
on R, as N > o. 

14.2.4. Let f be integrable on [—z, 7] and L ER. 


a) Prove that if (oy f)(xo) > Las N — oo and if (Sf)(xo) converges, 
then (Sy f)(xo) > L. 
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b) Prove that 


[e,2) 


4(—1)* sin V2 
sin V2 + rae cos kx 


converges to /27 cos /2x uniformly on compact subsets of (0, 27). 


14.2.5. Suppose that f : [a,b] — Ris continuous and that 


b 
/ x" f(x) dx =0 


for all integers n > 0. 


a) Evaluate / v P(x) f (x) dx for any polynomial P on R. 


b) Prove that fe | f(x)|? dx =0. 
c) Show that f(x) = 0 for all x ¢€ [a, D]. 


14.2.6. [SUMMABILITY KERNELS]. Let dy : R > R be a sequence of continuous, 
periodic functions on R which satisfy 


20 20 
én(t)dt =1 and lpn (t)| dt < M <0o 
0 


for all N < N, and 


2n—65 
tim f lon (t)|dt =0 
N>-o J5 


for each 0 < 5 < 2m. Suppose that f : R — R is continuous and 
periodic. Prove that 


20 


lim ff —Déw()dr = FO) 


N->oo 


uniformly for x € R. 
14.2.7. Let [a, b] be a nondegenerate, closed, bounded interval. 


a) Prove that given any polynomial P on R and any « > 0, there is 
a polynomial Q on R, with rational coefficients, such that | P(x) — 
QO(x)| < € for all x € [a, D]. 

*b) Prove that the space C[a, b] (see Example 10.6) is separable. 


*14.2.8. A sequence of functions fy : R > R is said to converge almost every- 
where to a function f if and only if there is a set E of measure zero such 
that fy(x) > f(x), as N > om, for every x € R \ E. Suppose that 
f : R= Ris also periodic. Prove that if f is Riemann integrable on 
[—z, 7], then oy f > f almost everywhere as N > oo. 
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*14.3. GROWTH OF FOURIER COEFFICIENTS 


This section uses material from Sections 5.5 and 14.2. 

By Theorem 14.14, a continuous periodic function f is completely determined 
by its Fourier coefficients. In this section we ask to what extent smoothness of f 
affects the growth of these coefficients. 

We begin with a computational result. 


14.18 Lemma. 
If f :R > Ris integrable on |—x, 1] and N is a nonnegative integer, then 


eles 


N 
1 
— [" FONSy Nanas + > (la Ar? + eA?) 
ie k=l (11) 


1 ca 
= Lf (Sw OOP dx. 
WT Jn 


Proof. Fix N => 0. Since f and Sy f are integrable on [—z, x], both integrals 
in (11) exist. By definition and orthogonality, 


1 a od 
t "Fos OD ay _ lao(f)| = "Sy fee) oO 
a4 2 4 


Similarly, 


1 w 1 Ht 
— | flx)ai(f) coske dx = lax NIP = — / (Sw f)(x)ag( f) cos kx dx 
and 


1 7 1 7 
— | f(x)be(f)sinkx dx = DP = — J (Sw AY)buCA) sinks dx 


—1 
for k € N. Adding these identities for k = 0,..., N verifies (11). a 


Next, we use this result to identify a growth condition satisfied by the Fourier 
coefficients of any Riemann integrable function. 


14.19 Theorem. [BESSEL’S INEQUALITY]. 
If f : R — Ris (Riemann) integrable on [—x, 2], then ~~, lax(f)|? and 
yo lb (AP are convergent series. In fact, 


2 ©& Lf 
OPE + D(a +r) <= f ifePax. (2) 


k= 


a 
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Proof. Fix N € N. By Lemma 14.18, 


1 ms 
0< -| L(x) — (Sy f(x)2dx 


1 De of 1 af 
= -[ fords — =f feySv fede + al (Sy POO dx 


= =f \fooPax- (ms +¥(ipr + [ef ’)). 


Therefore, 


sale 


N 1 1 
+> (lacpP + ibepr) == fo ifeorPas 


k=1 


for all N € N. Taking the limit of this inequality as N — oo verifies (12). Since 
|f|* is Riemann integrable when f is, it follows that both 77°, lax(f)|? and 
> 21 |be(f)? are convergent series. a 


14.20 Corollary. [RIEMANN-LEBESGUE LEMMA]. If f is integrable on 
[—z, 2], then 


lim ae(f) = lim be(f) = 0. 


Proof. Since the terms of a convergent series converge to zero, it follows from 
Bessel’s Inequality that a,(f) and by (f) converge to zero as k > oo. | 


Our next major result shows that Bessel’s Inequality is actually an identity 
when f is continuous and periodic. First, we show that the partial sums of the 


Fourier series of a function f are the best approximations to f in the follow- 
ing sense. 


14.21 Lemma. 
Let N EN. If fis (Riemann) integrable on [—x, 1] and 


N 
Ty = > + Ds cos kx + d; sinkx) 


is any trigonometric polynomial of degree N, then 


i; 0) -(SuAPds s | Lf) — Tw) Pde. 


—T 
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Proof. Notice by (11) that 

if Lf) — Twa)? dx 
= [- f(x) — (Sw f(x) + (Sw f)@) = Ty (x)? dx 
= : fe) — (Sw fC? dx 


+2 f (Fx) — (Sw AQ) CSn f)&) — Tr (x)) dx 


Hf ie (Sw f)(x) — Ty (x) |? dx 

> is f(x) — (Sw f)(@)P? dx + 2 [sv NENT — f (x) Ty (x)) dx. 
This last term is zero since, by orthogonality, 
- [ : (Sw f)(x) Tn (x) — f(x) Ty (x) dx 


= oe 5 (aul fee + bel fe) 


k=1 


as N - 1s 
Hef FOL f toneos indy 
j= 
N a 
-ye/ f(x) sin jx dx 
“= 7 Jon 


= aoe 4 Yauco + bk (f)dk) 


= (ape ate J act fet + nia) 


k=1 
= 0. 


Consequently, 


/ fe) — TwanPas > f If (x) — (Sw f(x)? dx. 
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14.22 Theorem. [PARSEVAL’S IDENTITY]. 
If f :R > Ris periodic and continuous, then 


lao(f)I? 
2 


. 2 2 if" 2 
+2 (mc +l) == f LfGPdx. (13) 


Proof. By Bessel’s Inequality, we need only show that the left side of (13) 
is greater than or equal to the right side of (13). Since f is continuous and 
periodic, oy f > f uniformly on R as N > oo by Fejér’s Theorem. Hence, it 
follows from Lemmas 14.18 and 14.21 that 


1 ua 2 N 
= fife ax - PE 9% (laf? + cP) 


= k=1 


1 a 1 
= -| If (x) — (Swf)? dx < -| f(x) — (on fy)? dx > 0 


Tt iT 


as N —> ov. In particular, 


1 4 2 ©9 
=f ifeoPax s SOP 4S (lac? + NP). a 


a =] 


~ 


The Riemann—Lebesgue Lemma can be improved if f is smooth and periodic. 
In fact, the following result shows that the smoother f is, the more rapidly its 
Fourier coefficients converge to zero. 


14.23 Theorem. Let f :R — Rand j €N. If f\ exists and is integrable on 
[—2, 2] and f© is periodic for each 0 < t < j, then 


Jim kKla(f) = Jim ki bx(f) = 0. (14) 


Proof. Fix k € N. Since f is periodic, integration by parts yields 


4 


af’) = S f'(x) coskx dx = f(x) sinkx dx = kby(f). 


WT J—x — 


Similarly, by(f’) = —kaz(f); hence ay(f”) = kby(f’) = —k*ax(f). Iterating, 
we obtain 


‘ kia when j is even, 
aga oO ii 
|kJ be (f)| when j is odd. 
A similar identity holds for |b,(f‘)|. Since the Riemann—Lebesgue Lemma 
implies ar(f) and by (f) > 0as k — on, it follows that k/a,(f) > 0 and 
kJ by (f) > Oask > o&. a 
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This result shows that if f is continuously differentiable and periodic, then 
kag(f) and kby(f) both converge to zero as k — oo. Recall that if f is 
continuously differentiable on [—z, 2], then f is of bounded variation (see 
Remark 5.51). Thus it is natural to ask, How rapidly do ka,(f) and kb;(f) 
grow when f is a function of bounded variation? To answer this question, let 
{xo, X1,.--,;Xn} be a partition of [—z, 2]. Using Riemann sums, the Mean Value 
Theorem, Abel’s Formula, and sinkxgo = sinkx, = 0, we can convince our- 
selves that 


Ta(f) = f(x) coskx dx © Y~ f (xj) coskxj (xj — xj-1) 


a = 
1 n 

x k S> f (xj) (sin kx; — sinkxj—1) 
j=l 
n—-1 


1 
= 7 Sj) — ferj+1) sinkx;. 
j=l 


Since the absolute value of this last sum is bounded by Var f, we guess that 
kla(f)| < Var f/s. 

To prove that our guess is correct, suppose for a moment that f is increasing, 
periodic, and differentiable on [—z, 7], and ¢(x) = sinkx. Then, by Defini- 
tion 14.3, periodicity, integration by parts, and the Fundamental Theorem of 
Calculus, we can estimate the Fourier coefficients of f as follows: 


tka (f)| = f(x)6"(x) dx 


=|roe ,- f reoeeas 


= f'(x)b(x) dx 


< i f'(x) dx 


= [> f@dx =o fe) fj) < Var f. 


j=l °%s-1 j=l 


The following result shows that this estimate is valid even when f is neither 
differentiable nor increasing. 


14.24 Lemma. 

Suppose that f and ¢ are periodic, where f is of bounded variation on |—1, 1] 
and ¢ is continuously differentiable on [—1, 7]. If M := supyer_z.7 191, 
then 


| , f (x)o' (x) dx| < M Var f. (15) 


iT 
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Proof. Since f is of bounded variation and ¢’ is continuous on [—z, z], the 
product f¢’ is integrable on [—z, 2] (see Corollary 5.23 and the comments 
following Corollary 5.57). 

Let ¢ > 0 and set C = sup, ej_7,7]|f(x)|. Since ¢’ is uniformly continuous 
and f¢’ is integrable on [—z, 7], choose a partition P = {xo, x1,..., X2n} of 
[—2, 2] such that 


w.c €[xj-1,xj] implies |¢'(w) — ¢/(0)| < ora (16) 
and 

2n 1 ns 

De fwpg'wpej—xj-)- | f@)d'@)ax| < 5 (17) 

j=l 


for any choice of w; € [xj-1, x]. 
Set 


2n 


A= > f (wi @)) — 6@)-1), 


j=l 


where w; = x; when j is even, w; = xj; when j is odd. By the Mean Value 
Theorem, choose cj € [xj—1, xj] such that @(xj) — d(j-1) = P (cj); — 
Xj-1). Then 


2n 


A=) f(ws)o'(ci)(xj — xj-1)- 


j=l 


Hence, it follows from (17) and (16) that 


A — | f(x)d'(x) dx 


2n 2n 
< | > f(wy)o'(ej) ej — xj-1) — D> Fwy)" wy) (Xj — xj—-1) 
j=l j=l 
2n x 
+) fwd wij —xj-)-— | fda) dx 
j=l = 
2n 2 
< DoF le(ep) — 6 wplOyj —xj- + 5 
j=l 


2n 
E é é € 
< — ; = + = t =—*tew 
= ae Xj v4 5 a) é 
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Combining this observation with the Triangle Inequality, we obtain 


| Feow @iar| Sipe. (18) 


On the other hand, by the choice of the w;’s, 


A => f (x2j-2)(G(x2)-1) — O(&2j-2)) + YS f 2) (G02) — 602;-1) 


j=l j=l 


= S> 6 (42j-DCF 02/2) — f (x2;)) 


et 


+ SCF (42))6 aj) — f 2j-2)6 (@2)-2)). 


j=l 


Since f and ¢ are periodic, this last sum telescopes to 0. Therefore, 


Sb (x2j- CF (2j-2) — f (x2;) 


j=l 


|A| = 


< S216 2j-11 IF 2j-2) — f aj)| <M Var. 


j=l 


This, together with (18), proves that 
as 
| <M Varf +e. 


f(x)6"(x) dx 


Taking the limit of this inequality as « > 0, we conclude that (15) holds. 


We now estimate the rate of growth of Fourier coefficients of functions of 
bounded variation. 


14.25 Theorem. Jf f : R — R is periodic and of bounded variation on 
[—z, 2], then 


tf 


V. 
hay ee 
IU 


and |kb(f)| < mt 


fork EN. 


Proof. Fix k € N and set @(x) = sinkx. Then ¢ is periodic and ¢/(x) = 
kcoskx is continuously differentiable on [0,27]. Hence, it follows from 
Lemma 14.24 that 
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1 [7 Var 

kax(f)| = - f (x)k cos kx dx E ae 
T Jn = 


1 TT 

= - Ff (x)'(x) dx 
WT Jon 

A similar argument proves that |kby(f)| < Var f/z. 


EXERCISES 
14.3.1. If f is integrable on [—z, 2] and a € R, prove that 


Tw 
lim if f(x) sin(k +a)x dx = 0. 
k>w J_x 


619 


14.3.2. Prove that there is no continuous function whose Fourier coefficients 


satisfy |ax(f)| > 1/k for k EN. 


14.3.3. Prove that if f : R > R belongs to C?(R) and f, f’ are both periodic, 
then Sf converges to f uniformly and absolutely on R. (See also Exer- 


cise 14.4.5.) 


14.3.4. If f : R > R belongs to C~(R) and f” is periodic for all j > 0, prove 


that Sf is term-by-term differentiable on R. In fact, show that 


j Ogi 
Lo) = yan coskx + by(f) sinkx) 


uniformly for all 7 ¢ N. 


14.3.5. Suppose that f : R > R is periodic on R, integrable on [—z, 7], and 


that ax(f) > Ofork =0,1,.... 


a) Prove that (S; f)(0) = (S; f)(O) for allk > j = 0. 
b) Prove that Sy f(0) < 202y f(0) for N EN. 
c) Prove that )7?° ; lax(f)| < co. 


d) Suppose that f is also even. Prove that f must be continuous and Sf 


converges uniformly and absolutely on R. 


14.3.6. Suppose that f : R — R is continuous and periodic. The modulus of 


continuity of f is defined by 


o(f,6)= sup |f(t+h)— fo). 
ae 


a) Show that 


18 


a(f) = al (sw) — f (w+ £)) cos ku du 


fork EN. 
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b) Prove that 


la(Ml<so(fF) and la(fl<o(f—) 


fork EN. 
c) Use part b) to give a different proof the Riemann—Lebesgue Lemma 
in the special case when f is periodic and continuous. 


14.3.7. a) Compute the Fourier coefficients of f(x) = x. 
b) Prove that 


*14.4 CONVERGENCE OF FOURIER SERIES 


This section uses material from Sections 5.5, 14.2, and 14.3. 

We shall prove that, under certain conditions, a summable series must also 
be convergent. Such results, called Tauberian theorems, will be used to obtain a 
partial answer to the Convergence Question posed in Section 14.1 and further 
results concerning the growth of Fourier coefficients. 

The following result was the first Tauberian theorem discovered. 


14.26 Theorem. [TAUBER]. 
Leta, > Oand LER. If peer ax is Cesadro summable to L, then 


Proof. By Remark 14.11, it suffices to prove that )°72) ax < oo. Suppose to 
the contrary that bare ay = co. Then, given M > 0, there is an ng € N such 
that n > no implies s, := )-¢_9 ax => M. Let N > no. Then 


. pM eProp a Sabah ey ey ay 
N+1 N+1 N+1 


Taking the limit of this last inequality as N — oo, we obtain L > M for all 


M > 0. We conclude that L = oo, a contradiction. a 


This result can be used to improve the Riemann—Lebesgue Lemma for certain 
types of functions. 


14.27 Corollary. Let f : R — R be periodic on R and integrable on [—1, x). If 
ag(f) = Oand by(f) = 0 fork €N, then 


Co 


ye ZS 


k=1 
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Proof. By considering g = f — ao(f), we may suppose that ao(f) = 0. Let 


F(x) = is f(t) dt. 
0 


By Theorem 5.26, F is continuous on R. Since ap(f) = 0, F is also periodic. 
Hence, by Fejér’s Theorem, (oy F)(0) ~ F(0) =O as N — ov. Integrating by 
parts, we obtain 


a(f) _ 
= 


bk (f) 
k 


ar(F) = >0O and b(F)=- 0. 


It follows that )°?° , be(f)/k is Cesaro summable [to —ao(F)/2] and has 
nonnegative terms. We conclude by Tauber’s Theorem that $772, be(f)/k 
converges. | 


We are now in a position to see that the converse of the Riemann—Lebesgue 
Lemma is false. Indeed, if 


were the Fourier series of some integrable function, then, by Corollary 14.27, 


[e,2) 


1 
= klogk 


would converge, a contradiction of the Integral Test. 
The following result is one of the deepest Tauberian theorems. 


14.28 Theorem. [HARDY]. 
Let E © Rand suppose that fy : E — R is a sequence of functions which 
satisfies 


Ik fi(x)| <M (19) 


for allx € E, allk € N, and some M > 0. If ¥°?2o fx is uniformly Cesdro 
summable to f on E, then \~°~.9 fx converges uniformly to f on E. 


Proof. Fix x € E and suppose without loss of generality that M > 1. For each 
n=0,1,..., set 


So(x) + +++ + Sn (x) 
n+1 


’ 


MOS KO. AES 
k=0 


and consider the delayed averages 


622 Chapter 14 Fourier Series 


Sy(X) +++ + Sp4e(X) 
k+1 


On,k(X) i= 


defined for n, k > 0. 
Let 0 < e < 1. Foreachn € N, choose k = k(n) € N such thatk +1 < 
ne/(2M) <k+2. Then 


n—-1 n 2M 


< < <&. (20) 
k+1 k+l g 


Moreover, since 


(Sn(X) = Sn(*)) + +++ + nk) = Sn) 
k+1 


On,k (x) — Sp(x) = 


n+k 


= eS 
-X() 1—*) F300. 


it follows from (19) and the choice of k = k(n) that 


n+k n+k 
nae) HO < YY GIS MY? ; < “os <5. (21) 
j=nt+l j=ntl 
Since o, > f uniformly on E, choose N € N such that 
Pe: 
n>N and xe€E imply |lo,(x)— f(x)| < EEE (22) 


Since 
eis: n—1 n—-1 
On, k\X) = eid On+k pe On-1, 
it follows from (20), (21), and (22) that 
Isn(x) — FO) S [sn ) — On,k OO)! + lone — f)| 


—1 
< 5 + (1 + —) lOn+k(x) i. f (x)| 


n—1l 
+ (7) lon—1(x) — f(x)| 


a 142M 2 2M 2 

<s- 

5 € 12M e \12M 
E 


E E E € € 


for anyn > N andx e€ E. We conclude that s, — f uniformly on E as 
n—> . a 
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We are prepared to answer the Convergence Question posed in Section 14.1 
for piecewise continuous functions of bounded variation. 


14.29 Theorem. [DIRICHLET-JORDAN]. 
If f :R > Ris periodic on R and of bounded variation on [—1x, 1], then 


f(x+) + f@—) 


wim GwAG) = 5 


for every x ER. If fis also continuous on some closed interval I, then 
li = 
Wess Swf f 
uniformly on I. 
Proof. Since f is periodic and of bounded variation, the one-sided limits 
f(xt+) and f(x—) exist for each x € R, and f is Riemann integrable on 
[—2, z] (see the comments which follow the proof of Corollary 5.57). Hence, 


by Fejér’s Theorem, both conclusions hold if Sy is replaced by oy. Since 
Theorem 14.25 implies 


|kag(f) coskx| and = |kby(f)coskx| < 


Var f 
a 


for k € N, it follows from Hardy’s Theorem that both conclusions hold as 
stated. a 


We close this section with an application of Fourier series to an extremal prob- 
lem. We will show that among all smooth simple closed curves in R? with a given 
arc length, the largest area is enclosed by a circle. (The proof presented here 
comes from Marsden [7].) 


14.30 Theorem. [THE ISOPERIMETRIC PROBLEM]. 

Let E be a region in R? whose topological boundary C = 3E is asmooth closed 
simple curve of length 2x. If A = Area(E), then A < a. Moreover, A = x if 
and only if E = By(a,b) forsomea,b eR. 


Proof. Let (v, [0, 27r]) be the natural parametrization of C; that is, ||v’(s)|| = 1 
for all s € [0, 277]. Set 


1 20 1 2n 
a= =f v1(s) ds, b= = f v2(s) ds, 
P(s)=vi(s)—a, Q(s)=v2(s)—b, and (s) = (P(s), Q(s)) 


for s € [0, 27], where (11,2) := v. Clearly, (¢, [0,27]) is a smooth 
parametrization of dE — (a,b) whose trace is a smooth closed simple curve 
with arc length 27 which encloses a region with area A. Moreover, 
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IP(s)? + Q's)? = 1, (23) 
1 Qn 1 Qn 
— P(s)ds =0, — O(s)ds =0, (24) 
2m Jo 2m Jo 


and, by Green’s Theorem, 


20 
A =/f dA =) x dy a, P(s)Q'(s) ds. (25) 
E aE 0 


Let ax, by (respectively, cx, dx) represent the Fourier coefficients of P 
(respectively, Q). Since (@, [0,27]) is smooth and closed, P and Q are 
continuously differentiable and periodic. By (24) and the Dirichlet-Jordan 


Theorem, 
CO CO 

P(s)= Sa cosks + by sinks), O(s) = Sc cosks +d, sinks), (26) 
[e,2) [o,e) 

P'(s) =) (kby cosks — kag sinks), and Q'(s) = )(kdy cosks — key sinks) 
k=l k=l 


(27) 


uniformly on [0, 277]. Hence, by (23) and Parseval’s Identity, 


Qn OF 
Qn = / (IP’(s)? +1Q'(s)?) ds = 2k (ag + bE + cg + dp). 
0 k=l 


Moreover, by (25) and orthogonality 


Qn ed) 
A= i; P(s)O'(s)ds = mY k(agdk — bycr). 
0 


k=1 


It follows that 


CO CO 
us a 
nA => ze — k(az + bz +2 +d?) + 2K = dk)” +(ck + by)”) = 0. 


In particular, A < mw and A = a if and only if aq, = dj, cy) = —b1, and 
an = by = cx = ad = Ofork > 2. 

Suppose that A = z. Then P(s) = a; coss+b; sins and Q(s) = —b; coss+ 
a, sins = —P(s + 5). Thus P’(s) = —Q(s) and Q’(s) = —P”(s) = P(s) for 
all s € [0,27]. It follows from (23) that ¢([0, 27r]) is a subset of 2B, (0, 0). 
Since (0) = ¢(277), we must have ¢([0, 277]) = 0B, (0, 0). Therefore, C is the 
boundary of the disk E = B,(a, b). | 
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EXERCISES 


14.4.1. 


14.4.2. 


14.4.3. 


14.4.4. 


14.4.5. 


Suppose that f is continuous and of bounded variation on [—z, 7]. 
Prove that Sy f — f pointwise on (—z, z) and uniformly on any 
[a,b] C (—7, 2). 

a) Prove that 


pointwise on (—z, 2) and uniformly on any [a, b] C (—7z, 7). 
b) Prove that 


mx 4 \cos(2k — 1)x 
Wa 2D (Qk —1)2 


uniformly on [—z, zr]. 
c) Find a value for 


3 1 
(rk =1) 


Prove that if f is continuous, odd, and periodic, then )°?°, be(f)/k 
converges. 

Let L € R. A series )°7°.9 ay is said to be Abel summable to L if and 
only if 


CO 
lim Scagr* i Be 
k=0 


r>l- 


a) Let S = ae, ax. Prove that 


[o,2) Co oC 
Yiagr® = (ry S Ser* = ry OK + Doge’, 
k=0 k=0 k=0 


provided any one of these series converges for all 0 <r < 1. 

b) Prove that if }°729 ax is Cesaro summable to L, then it is Abel 
summable to L. 

c) Prove that if f is continuous, periodic, and of bounded variation on 
R, then Sf is Abel summable to f uniformly on R. 

d) Show that if a, > O and aes a, is Abel summable to L, then 
yo ak converges to L. 


[BERNSTEIN]. Let f : R — R be periodic and a > 0. Suppose that f is 
Lipschitz of order q; that is, there is a constant M > 0 such that 


[f(x +h) — f(x)| < MIh|* 
forallx,h eR. 
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a) Prove that 


if “hid 2 BD cree ayeee 
© fife + iy = foe Wy dx = AS CaRCL) + ORCA) si? ke 


me k=1 


holds for each h € R. 
b) Ifh =2/2"t!, prove that sin? kh > 1/2 for all k € [2”—!, 2”). 
c) Combine parts a) and b) to prove that 


Qn] We 5 
N+), <M? (=) 
k=2n-1 
forn = 1,2,3,.... 
d) Assuming 

2r—1 2"—1 i 

SY aA + le <2"? | Yo GEA) + RAD) 

k=2n-1 k=2"-1 


(see Exercise 11.7.9), prove that if f is Lipschitz of order a for some 
a > 1/2, then Sf converges absolutely and uniformly on R. 

e) Prove that if f : R > R is periodic and continuously differentiable, 
then Sf converges absolutely and uniformly on R. 


*14.4.6. Suppose that f : R — R is periodic and of bounded variation on 
[—z,2]. Prove that Sy f — f almost everywhere as N — oo (see 
Exercise 14.2.8). 


*14.5 UNIQUENESS 


This section uses material from Section 14.4. 
In this section we examine the Uniqueness Question posed in Section 14.1. 
We begin with the following generalization of the second derivative. 


14.31 Definition. 


Let xo € R and let J be an open interval containing xo. A function F:] > R 
is said to have a second symmetric derivative at xo if and only if 


F (xo + 2h) + F(xo — 2h) — 2F (x0) 
4h? 


DoF a) = ig, 


exists. 


14.32 Remark. Let x9 € R and let I be an open interval containing xo. If F is 
differentiable on I and F" (xq) exists, then F has a second symmetric derivative at 


XO and D> F (xo) = F" (xo). 
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Proof. Set G(t) = F(xo + 2t) + F(xo — 2t) fort € J and H(t) = 4? and fix 
t € I. By Theorem 4.15 (the Generalized Mean Value Theorem), 


F(xo + 2t) + Fo — 2t) —2F (xo) — GX) - GO) _ Ge) 
4t2 ~ H(t)—H()  H'(c) 
F'(xo + 2c) — F’(xo — 2c) 
- 4c 


for some c between 0 and ¢. Since c > Oast — 0, it follows that 


F'(xo + 2c) — F’(xo9 — 2c) 


D2F (xo) = Jiny 


Ac 
1. F'(xo + 2c) — F'(x9) — -F' (xo) — F’ (xo — 2c) 
= -— lim + 
2 c30 2c 2c 
1 
= ~(F" (x0) + F"(x0)) = F" (x0). 
2 r] 


The converse of Remark 14.32 is false. Indeed, if 


1 x >0O 
F(x)= 0 x =0 
—1 x <0, 


then DF (0) = 0 but F”(0) does not exist. 

The following result reinforces further the analogy between the second 
derivative and the second symmetric derivative (see also Exercises 14.5.1 
and 14.5.5). 


14.33. Lemma. 

Let [a, b] be a closed bounded interval. If F : [a,b] — R is continuous on 
[a, b] and D2 F (x) = 0 for all x € (a, b), then F is linear on [a, b); that is, there 
exist constants m, y such that F(x) =mx + y forall x € [a, b]. 


Proof. Let ¢ > 0. By hypothesis, 


F(b) — F(a) 
b-— 


b(x):=F (x) — F@) ( Jo a) + e(x —a)(x — b) 


is continuous on [a, b], and, by Remark 14.32, 
Do6(x) = DoF (x) + 2e = 2e (29) 


for x € (a, b). 

We claim that (x) < 0 for x € [a, b]. Clearly, d(a) = o(b) = 0. If d(x) > 0 
for some x € (a,b), then ¢ attains its maximum at some x9 € (a,b). By 
Exercise 14.5.1, Dod(xo) < 0; hence, by (29), 2e < 0, a contradiction. This 
proves the claim. 
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Fix x € [a, b]. We have shown that 


F(b) — F(a) 
b-a 


F(x) — F(a) ( Jo a) <e(x —a)(b— x). 


A similar argument establishes that 


F(x) — F(a) (Sar) a) > —8(x —a)(b — x). 
Therefore, 
| F(x) — F@) (Se a) |< e(x —a)(b—x) < e(b—a)”. 


Taking the limit of this inequality as e > 0, we conclude that 


F —f 
Fis) = Fa) + ( ®) ole a) 


b-— 


for all x € [a, b]; that is, F is linear on [a, b]. a 


14.34 Definition. 


The second formal integral of a trigonometric series, 


[o,2) 
do . 
S=— ) k b kx), 
5 + (ax cos kx + by sinkx) 


k=1 
is the function 
a > sca | 
F(x)= 4° _ Dig coskx + by sinkx). 


By the Weierstrass M-Test, if the coefficients of § are bounded, then the sec- 
ond formal integral of S converges uniformly on R. In particular, the second 
formal integral always exists when the coefficients of S converge to zero. 

Notice that the second formal integral of a trigonometric series S is the result 
of integrating S twice term by term. Hence, it is not unreasonable to expect that 
two derivatives of the second formal integral F might recapture the original 
series S. Although this statement is not quite correct, the following result shows 
that there is a simple connection between the limit of the series S and the second 
symmetric derivative of F. 
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14.35 Theorem. [RIEMANN]. 
Suppose that 


CO 
ao : 
s=— ) k b k 
5) nee x + be sinkx) 


is a trigonometric series whose coefficients ax, by — 0 as k — oo and let F be 
the second formal integral of S. If S(xo) converges to L for some xo € R, then 
D2 F (xo) = 


Proof. Let Fy denote the partial sums of F. After several applications of 
Theorem B.3, we observe that 


km Fn (xo + 2h) + Fn (xo — 2h) — 2F (xo) 
h->0 Ah2 


; sinkh 
= jim (3 + xe cos kxo + by sinkxo) ( th ) ) 


k=1 


N 
= + 2 (0 cos kxo + by sinkxg) 


holds for any N € N. Therefore, it suffices to show that given ¢ > 0 there is 
an N € N such that 


CO 
kh 
[Ry l:= rE Ze cos kxo + by sinkxo) (— ) <€é (30) 


for all |h| < 1. 
Let 


= sinkh \? 
by (aj cos jxo +b; sin jxo) and Be= ( ih ) 
jok+ 


fork € N. Since A, — 0 asn > ov, we have by Abel’s Formula that 


= lim > (Ag—1 — Ax) By 


n>C 
k=N+1 
n—-1 
= lim | (Ay — An)Bn— > (Ay — Ag)(Be+1 — Be) (31) 
noo koN+1 


Co 
=AnByiit >> Ax(Beyi — By). 
k=N+1 
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Moreover, by the Fundamental Theorem of Calculus, 


k+Dh q /sint\? 
hah 
kh dt\ t 
d /(sint =. Dsint tcost — sint 
dt\ 4 7 38 t2 


is bounded near t = 0 and is bounded by 2(t + 1)/t? < 2/1? for t > 2, it is 
clear that the improper integral 

d (sint\? 

dt\ t 


=| 


converges. Since {B,} is bounded and Ay — 0 as N —> ov, we can choose an 
N € Nsuch that 


|Brzi — Bel = (32) 


Since 


dt 


(AgBaait 5 and k>N implies |Ag| < (33) 


It follows from (32) that 


> Mies os ee d (=) a 
sae Pee oe pees 
k=N+1 2€ k=n 1 |Y kA dt\ t 
E [ d (sint\* € 
< dt=-. 
2C Jo |dt t 2 


Combining this inequality with (31) and (33), we conclude that |Ry|<e«. Mf 


The following result shows that the hypotheses of Riemann’s Theorem are 
satisfied by any trigonometric series which converges pointwise on a nondegen- 
erate interval. 


14.36 Theorem. [THE CANTOR-LEBESGUE LEMMA]. 


If 


CO 
SS eal + Sax cos kx + by sinkx) 


2 
k=1 


is a trigonometric series which converges pointwise on a nondegenerate interval 
La, b], then its coefficients satisfy ax, by > 0 ask — ov. 


Proof. Set po = ag/2 and pp = ap + be fork € N. If the result is false, then 
there is ad > O such that p; > 6 for infinitely many k € N. 
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Set 09 = 0 and for each k € N define & € Rso that ag = px coskO, by = 
px sink. By a sum-angle formula, 


ao 


n n 
5 + Sax cos kx + by sinkx) = >»: pRcos k(x — O) 


k=] k=0 
for each x € Randn EN. Since S converges on [a, b], it follows that 


lim px cosk(x — 6.) =0 (34) 
k->oo 


for all x € [a, b]. 

Set Jo = [a,b] and ky = 1. Fix j > O and suppose that a closed interval 
I; ¢ Ip and an integer kj > ko have been chosen. Choose kj4, > kj such that 
kjsillj| > 20 and Pk ja, > 8. Clearly, kj+1(x — Ox ;,,) uns over an interval 
of length > 2m as x runs over /;. Hence, we can choose a closed interval 


Tj41 G1; such that 


1 
xeE Tj41 implies cos kj44(x _ Ok j41) => 3" 


By induction, then, there exist integers 1 < k; < ky < ... and a nested 
sequence of closed intervals Jj) > 1; D> ... such that 


rf) 
Pk; COSK (x — O;) = 5 (35) 


for x € Ij, j € N. By the Nested Interval Property, there is an x € 1; 
for all j € N. This x must satisfy (35) for all 7 € N and must belong to 
[a, b] by construction. Since this contradicts (34), we conclude that p, > 0 as 
k> o. | 


We are now prepared to answer the Uniqueness Question for continuous 
functions of bounded variation. 


14.37 Theorem. [CANTOR]. 
Suppose that 


CO 
a0 : 
S=—+ Sa coskx + by sinkx) 
2 
k=1 
converges pointwise on |—z, 1] to a function f which is periodic and continuous 


on R, and of bounded variation on [—1, 1]. Then S is the Fourier series of f; 
that is, ay = ag(f) fork =0,1,..., and by = by (f) fork =1,2,.... 


Proof. Suppose first that f(x) = 0 for all x ¢ R. By the Cantor—Lebesgue 
Lemma, the coefficients a;, b, tend to zero as k — oo. Thus the second formal 
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integral F of S is continuous on R and by Riemann’s Theorem has a second 
symmetric derivative which satisfies D2 F(x) = 0 for x € R. It follows that F 
is linear on R; that is, there exist numbers m and y such that 
a ae | 
mx+y= 0 2 Doe (az cos kx + by sinkx) 
k=1 


for x € R. Since the series in this expression is periodic, it must be the case 
that m = ag = 0; that is, 


CO 
1 
y+ AG coskx + by sinkx) = 0 
k=1 


for all x € R. Since this series converges uniformly, it follows from 
Theorem 14.4 that y = 0 and a, = by = 0 for k € N. This proves the theorem 
when f = 0. 

If f is periodic, continuous, and of bounded variation on [—z, wz], then 
Sy f — f uniformly on R by Theorem 14.29. Hence, the series S — Sf con- 
verges pointwise on R to zero. It follows from the case already considered 
that a, —a,(f) =O fork =0,1,..., andb, —by(f) =O fork =1,2,.... I 


EXERCISES 


14.5.1. Suppose that F : R > R has a second symmetric derivative at some xo. 
Prove that if F (xq) is a local maximum, then D> F (xo) < 0, and if F (xo) 
is a local minimum, then Dy F (xo) > 0. 

14.5.2. Prove that if the coefficients of a trigonometric series are bounded, then 
its second formal integral converges uniformly on R. 

14.5.3. Prove that if f : R — R is periodic, then there exists at most one 
trigonometric series which converges to f pointwise on R. 

14.5.4. Suppose that f : R — R is periodic, piecewise continuous, and of 
bounded variation on R. Prove that if S is a trigonometric series which 
converges to (f(x+) + f(x—))/2 for all x € R, then S is the Fourier 
series of f. 

*14.5.5. Suppose that F : (a,b) — R is continuous and D2F(x) > 0 for all 
x € (a, b). Prove that F is convex on (a, b). 


Appendices 


A. ALGEBRAIC LAWS 


In this appendix we derive several consequences of the ordered field axioms 
(i.e., Postulates 1 and 2 in Section 1.2) and show that by Postulate 3, if t > 0, 
then /t ER. 


A.1 Theorem. Letx,acR. 


i) Ifa=x +a, thenx =0. 
ii) Ifa =x-aanda £0, thenx =1. 


Proof. i) Since the additive inverse of a exists, we can add —a to the equation 
a =x +a. Using the Associative Property and the fact that 0 is the additive 
identity, we obtain 


0=a-+ (-—a) = (x +a) + (—a) = x+ (a+ (—a)) =x+0=x. 


ii) Since the multiplicative inverse of a exists, we can multiply a = x - a by 
a~'. Using the Associative Property and the fact that 1 is the multiplicative 
identity, we obtain 


l=a-a!=(x-a)-a!=x-(a-a')=x-1l=x. a 
Theorem A.1 shows that the additive and multiplicative identities are unique. 
The following result shows that additive and multiplicative inverses are also 


unique. Thus unique can be dropped from the statements in Postulate 1. 


A.2 Theorem. 


i) Ifa,b € Randa+b =O, then b = —a. 
ii) Ifa, b € Rand ab = 1, thenb =a". 


Proof. i) By hypothesis and the Associative Property, 
—a=-—a+(a+b)=(-a+a)+b=0+D=bD. 


ii) Since 1 4 0, a 4 0. Thus it follows from hypothesis and the Associative 
Property that 


alt= a '(ab) = (a~!a)b =1-b=b. 1:2 


A.3 Theorem. For all a,b € R, 0-a=0, —a = (—1)-a, —(—a) =a, (—1)? = 1, 
and —(a — b) =b—a. 
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Proof. Since 1 is the multiplicative identity and 0 is the additive identity, it 
follows from the Distributive Property that 


a+0-a=1-a+0-a=(1+0)-a=1-a=a. 
Hence, by Theorem A.1, 0-a = 0. Similarly, 


a+(-l)-a=(U4+(-1))-a=0-a=0. 


Since additive inverses are unique, it follows that (-1)-a = —a. Since 
—a+a = a+(-a) = 0, a similar argument proves that —(—a) = a. 
Substituting a = —1, we have 


(-1)(-1) = -(-I) = 1. 


Finally, for any a, b € R, we also have 


(a —b) = (-1)(a— 5) = (-Dat+ (-1)(-)) = -a+b=b-a. a 


A.4 Theorem. Leta,b,c <R. 
i) Ifa-b=0, thena=O0orb=0. 
ii) Ifa-b=a-canda £0, thenb=c. 


Proof. i) Ifa = 0, we are done. If a 4 0, then multiplying the identity 0 = a-b 
by a—!, we have 


O=a°':0S0 '. sb) =@"! +a): P= 1-0 Sd. 
ii) Ifa-b =a-c, then by Theorem A.3 we have 
a:-(b—c)=a-(b+(-lc) =a-b4+(-lDa-c=a-b—-a-c=0. 
Since a # 0, it follows from part i) that b — c = 0 (i.e., b =). a 


A subset £ of R is called inductive if 


lekE 
and 


for every x € E, x + 1 also belongs to E. (1) 


Notice by Postulate 1, R is an inductive set. 

Define N to be the set of elements which belong to ALL inductive sets, and 
set Z := {k € R: k € N,—-k EN, ork = O}. Notice that N is the smallest 
inductive set (i.e., N is inductive), and N C E for any inductive set E. Indeed, 
since | belongs to all inductive sets, 1 ¢ N. Ifk € N and E is any inductive set, 
then k € E. Since E is inductive, it follows that k + 1 € E for all inductive sets 
E,sok+1 EN. Finally, if k ¢ N and E is inductive, then by definition, k € E. 
Thus N C E. 
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Since N is the smallest inductive set, it automatically satisfies the Principle of 
Mathematical Induction. Indeed, if E C N satisfies “1 ¢ E" and “k € E implies 
k+1 € E," then E£ is an inductive subset of the smallest inductive set N and, 
hence, must be equal to N. 

It is now fairly easy to prove that N and Z, as defined above, satisfies the 
assumptions made in Remark 1.1. We begin with an easy consequence of the 
Trichotomy Property. 


A.5 Lemma. 
Given n € Z, one and only one of the following statements holds: n € N, 
—n EN, orn=0. 


Proof. Since (0, oo) is an inductive set, all elements of N are positive. By the 
definition of Z, given n € Z, one of the following statements holds: n € N, 
—n EN, orn = O. It follows that either n > 0,n < 0, orn = 0. Since the 
Trichotomy Property implies that only one of these conditions can hold for a 
given n, the lemma is proved. | 


Next, we show that N and Z satisfy the assumptions made in Remark 
1.1ii and iii. 


A.6 Theorem. 


i) Ifn € Z, thenn € Nif and only ifn > 1. 
ii) Non € Zsatisfies0 <n <1. 


Proof. i) Since [1, co) is an inductive set, N C [l,0oo). Thus everyn € N 
satisfies n > 1. 

Conversely, ifn € Zandn > 1, thenn 4 0so by Lemma A.5, either n ¢ N 
or —n € N. But if —n € N, then by what we’ve already proved, —n > 1 
so by the Second Multiplicative Property and Example 1.2,n < —1 < l,a 
contradiction of the Trichotomy Property. We conclude that n € N. 

ii) Ifn € Zandn > 0, then by Lemma A.5,n € N. Hence by part i), n > 1 
so it cannot satisfy n < 1. | 


We need two more preliminary results before we prove that Z is closed under 
addition, subtraction, and multiplication. 


A.7 Lemma. 
[fn éZ, thnn+1€Z. 


Proof. Fixn € Z. By Lemma A.5,n = 0, orn € N, or —n EN. 

Ifn = 0, then n + 1 = +1 belong to Z by definition. 

Ifn € N, thenn+1¢N C Z because N is an inductive set. On the other 
hand, since A = {k € N: k —1 € Z} is obviously an inductive set, we have 
A=N (ie.,n € Nimplies n — 1 € Z). 
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Finally, if —n € N, then by what we just proved, —n+1 € Z, so by definition, 
n+t1=—-(-n#F1)€Z. a 


Next, we prove that N is closed under addition and multiplication. 
A.8& Theorem. [fn,m €N, thenn-+m and nm both belong toN. 


Proof. Fix n € N, and consider the set A := {m € N:n+m &N}-. Since N is 
an inductive set, 1 ¢ A. Ifm € A for some m > 1, thenn +m EN. Since N is 
inductive, it follows that n + (m+1) = (n+m)+1€N (ie.,m+1 € A). Thus, 
by induction, A = N and closure holds for addition. 

For multiplication, consider B = {m € N: nm € N}. Clearly, 1 € B. If 
some m € B (i.e., mn € N), then n(m + 1) = nm +n also belongs to N since 
we have already proved that N is closed under addition. Thus m € B implies 
m-+1e B. By induction B =N. a 


We are now prepared to prove that Z is closed under addition, subtraction, 
and multiplication. 


A.9 Theorem. Ifn,m € Z, thenn +m, n—m, and nm all belong to Z. 


Proof. Fixn € Z. By Lemma A.5,n = 0, orn € N, or —n EN. 

Suppose that n = 0. Then for all m € Z,n +m =+m € Zandnm =0€ Z. 

Suppose that n € Nandm e Z. By Lemma A.5, either m € N, m = 0, 
or —m EN. In the first two cases, we have by Lemma A.8 or a triviality that 
n +m and nm both belong to N, hence also to Z. If —m € N, then consider 
the sect A = {k EN: n-—k ©€ Z}. By Lemma A.8, 1 € A. Ifk € A, then 
n—(k+1) = (n—k) —1 € Z by the Inductive Hypothesis and Lemma A.8 
again. By induction, then, A = N (ie.,n —k € Z for all k € N). In particular, 
n—m € Zas promised. A similar argument shows that nm € Z. 

Finally, if —n € N, then by what we just proved, (—n) + m and (—n)m € Z 
for all m € Z. Since Z is closed under multiplication by —1, we conclude that 
n+m and nm belong to Z. a 


Our next result shows that the Complete Ordered Field axioms (Postulates 
1, 2, and 3) guarantee that every positive real number has a square root. The 
same method can be used to show that if t > 0 andn EN, then ¢/f exists. Thus 
r”/" :— ./t™ can be defined using only the Complete Ordered Field axioms. 


A.10 Theorem. Jf t > 0, then ./t exists; that is, there is a number b € (0, 0) 
which satisfies b? = t. 


Proof. Our proof has two steps. 

Step 1. A perfect square can always be made a little bigger or a little smaller; 
namely, given a <b? <c there exist real numbers x and y such that x <b < y 
and 


Cex <b ey 2E. 
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To prove this, let ¢ = min {0.5, (c — b*)/(2b + 1)} and set y = b+ ¢. Since 
b > Oand b* < c, the number «¢ is positive. Thus b < y. Since ¢ < 1, we also 
have ¢? < e. Therefore, 


y? =b?+2eb+e* <b? +2eb+e = b? + (2b + le. 
Since ¢ < (c — b”)/(2b+ 1) implies (2b + l)e <c— b?, it follows that 
y <b +(2b4+ le <b? +(c—b’) =c. 


We have proved that b? < y” <c. 

A similar, but simpler, proof establishes the existence of x. First, set x = 
b — e, where ¢ = (b* — a)/2b, and observe by hypothesis that e > 0. Hence 
x < b. Next, notice that 


x? = b* —2eb+ &* > b* —2eb +0 =b? —2eb. 


Since 2eb = b* — a, it follows that x? > b? — (b* — a) =a. 

Step 2. If E = {x € R: x > Oand x* < t}, then b = sup E exists and is the 
square root of t. 

To show that E has a supremum, notice first that by the Archimedean 
Principle, there is ann € N such that n > 1/t. Thus E is nonempty since 
x =1/n € E. Next, observe that E is bounded above by max{f, 1}. Indeed, 
ift < landx € E satisfies x > 1, thent > x* > 12 = 1, a contradiction. 
Thus E is bounded above by 1 when tr < 1. On the other hand, if t > 1 and 
x € E satisfies x > t, thent > x* > t? > t, acontradiction. Thus E is bounded 
above by t when t > 1. 

We have proved that E is nonempty and bounded above. Thus by the Com- 
pleteness Axiom, b := sup E exists and is finite. It remains to verify b* = t. 

Suppose to the contrary that b* 4 t. By the Trichotomy Property, either 
b* < t or b* > t. We shall show that both these assumptions lead to contra- 
dictions. 

Case 1. b* < t. Apply Step 1 with c = t to choose a y > b such that y? < ¢. 
Since y* < t, y € E. Since b = sup E, it follows that y < b. This contradicts 
the choice that y > b. 

Case 2. b* > t. Apply Step 1 with a = t to choose an x < b such that t < x’. 
Let w € E (ie., w > 0 and w? < 1). Since t < x’, the Transitive Property 
implies w* < x. This last inequality implies that w < x. Indeed, if w > x, 
then by squaring both sides we have w* > x”, a contradiction. 

We have proved that w < x for all w € E (i.e., x is an upper bound of E). 
Since b is the supremum of EF, we conclude that b < x, a contradiction of the 
choice that x < b. a 


We close this section by proving that, under mild assumptions, the Axiom of 
Induction and the Well-Ordering Principle are equivalent. 
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A.11 Theorem. Suppose that the Ordered Field Axioms hold and that 
neEN and n#1 implyn—-1€EN. (2) 


Then the Axiom of Induction holds if and only if the Well-Ordering Principle 
holds. 


Proof. By the proof of Theorem 1.11 [which used (2) at a crucial spot], the 
Well-Ordering Principle implies the Axiom of Induction. 

Conversely, if the Axiom of Induction holds, then the elements of N belong 
to every inductive set. This is the “definition" of N we made following (1) 
above. It follows that all results above are valid. (We will use Theorem A.6.) 

Suppose that E is a nonempty subset of N, and consider the set 


A:={xeEN:x<e forall ec E}. 


A is nonempty since 1 € A. A is not the whole set N since if eg € E, then 
eo + 1 cannot belong to A. Hence, by the Axiom of Induction, A cannot be an 
inductive set. In particular, there is an x € A such thatx +1 ¢ A. 

We claim that this x is a least element of £; that is, x is a lower bound of 
E and x € E. That x is a lower bound of E is obvious, since by construction, 
x € Aimplies x < e for all e € E. On the other hand, if x ¢ FE, then x < e for 
alle € E. Hence, by Theorem A.6ii, x + 1 < e foralle € FE (ie.,x+1¢A),a 
contradiction of the choice of x. | 


B. TRIGONOMETRY 


In this appendix we derive some trigonometric identities by using elementary 
geometry and algebra. 

Let (x, y) be a point on the unit circle x + y? = 1 and 6 be the angle mea- 
sured counterclockwise from the positive x-axis to the line segment from (0, 0) 
to (x, y) (see Figure B.1a). [We shall refer to (x, y) as the point determined by 
the angle @.] Define 


sind = y, cos9 =x, and une 
x 


By the Law of Similar Triangles, given a right triangle with base angle 0, altitude 
a, base b, and hypotenuse h (see Figure B.1b), sin@ = a/h, cos@ = b/h, and 
tan@ = a/b = sin@/cos@. 


B.1 Theorem. Given a circle C : x? + y? =r? of radius r, let s(@) represent the 
length of the arc on C swept out by 0, and A(@) represent the area of the angular 
sector swept out by 0 (see Figure B.la). If the angle 0 is measured in radians (not 
degrees), then 


r20 
s(@0)=ré and A(@)= ae 
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S-4-7- 


FIGURE B.1a 


FIGURE B.1b 


Proof. Since there are 27 radians in a complete circle and the circumference 
of a circle of radius r is 27r, we have 


s(0)_ 6. 


Qnr = 2m’ 
that is, s(9) = r@. Similarly, since the area of a circle is mr, we have 


AO) 0. 


’ 


wr? Qn 
that is, A(@) = r76/2. | 


B.2 Theorem. 
i) sin(O) = 0 and cos(0) = 1. 

ii) For any 6 € R, |sin6| < 1, |cos6| < 1, sin(—@) = —sin6, cos(—0) = cos6, 
and sin? 6 + cos? 6 = 1. 

iti) If 6 is measured in radians, then sin(z/2) = 1, cos(z/2) = 0, sin(@ + 27) = 
sin@, and cos(@ + 277) = cos 6. Moreover, if0 <6 < 1/2, then0 < @cosé < 
sin@ < 6. 

iv) If@ € Ris measured in radians, then | sin9| < |@|. 
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Proof. Let 6 € R and (x, y) be the point on the unit circle determined by 6. 
i) If 6 = O, then (x, y) = (1,0) (see Figure B.1a). Hence, sin(0) = 0 and 


cos(O) = 1. 


ii) Clearly, | sin@| = |y| = Vy? < /x?+ y? = 1, and, similarly, | cos6| < 1. 


By definition (see Figure B.2a), sin(—@) = —y = —sin@ and cos(—@) = 
x = cos@. Moreover, 


sin? @ + cos*?@ =x*+y*= 1. 


A 


1 


FIGURE B.2a 


iii) If 90 = w/2, then (x, y) = (0, 1), so sin(z/2) = 1 and cos(z/2) = 0. Fix 


6 € (0, 2/2) and consider Figure B.2b. Since sin 0 is the altitude of triangle 
ABC and the shortest distance between two points is a straight line, we 
have by Theorem B.1 that 


sind < s(0) =8@. 


On the other hand, the triangle ABC is a proper subset of the angular sec- 
tor swept out by @, which is a proper subset of the triangle ABD. Hence, 


Area (ABC) < A(@) < Area(ABD). 


Since the area of a triangle is one-half the product of its base and its 
altitude, it follows from Theorem B.1 that 


sin 0 2 0 - tan (3) 
2 2 2 


But 0 < cosé < 1 for all @ € (0,2/2). Multiplying (3) by 2cos6, we 
conclude that 


sinO cos@ < @cos@ < sind. (4) 
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FIGURE B.2b 


iv) By part iii), | sin@| = sind < 6 = |6| for all 0 < 6 < 7/2. Since sin(—0@) = 
— sin, it follows that |sin@| < |6| for all 6 € [—72/2,7/2]. Butif 6 ¢ 


[—2/2, 2/2], then | sin@| < 1 < w/2 < |6|. Therefore, | sin@| < |@| for all 
OER. | 


The next result shows how to compute the sine and cosine of a sum of angles. 


B.3 Theorem. 
i) [SUM-ANGLE FORMULAS]. If 6, g@ € R, then 
cos(@ + yg) = cos@ cosg + sin sing, 
and 
sin(@ + y) = siné cosy + cos@ sing. 


ii) [DOUBLE-ANGLE FORMULAS ]. If 6 € R, then 


I 
prey eis 
2 
1 — cos(26 
ieee SS 


and 
cos@ = 1 —2sin?(6/2). 


iti) [SHIFT FORMULAS ]. If g is measured in radians, then 


(5 ) 
sing =cos|—— 9), 
gy ) gy 


642 Appendices 
and 
ne) 
cos g = sin (— — 
y 7 & 
forallg ER. 


Proof. Suppose first that @ > g. Consider the chord A cut from the unit circle 
by a central angle 6 — gy, and the chord B cut from the unit circle by a central 
angle y — @ (see Figure B.3). Since sin? 6 + cos* 6 = 1, we have 


A* = (cos@ — cos v)* + (sind — sin gy = 2-—2(cosOcosg+sin@sing), (5) 


and 


B? = (cos(6 — g) — 1)? + (sin(6 — g))* = 2 — 2cos(0 — 9). 


AY 


(cos @, sin @) 


(cos®, sin) 


#Y 


FIGURE B.3 


Since |9 — g| = |g — 4|, the lengths of these chords must be equal. Thus 

cos(@ — yg) = cos@ cosg + sind sing (6) 
for @ < g. A similar argument establishes (6) for g < 6. Since (6) is trivial 
when @ = 9, we have proved that (6) holds for all 6 and y. Combining the 
identities sin(—@) = — sin@ and cos(—@) = cos @ with (6), we obtain 


cos(@ + y) = cos(@ — (—@~)) = cos@ cos g — sin@ sing. 


This and (6) verify the first identity in part i). 
Applying this identity to 6 = 2/2, we see by Theorem B.2ii that 


(5 ) (5) . i (5) 
cos | — — g) =cos(—) cos sin { —) sing = sing; 
0 5) p >) yg ) p Q; 


Section C. Matrices and Determinants 643 


that is, the first identity in part iti) holds. Combining the first identities in parts 
i) and iii), we obtain 


sin(@ + ~) = cos (S —O0)F ) 
= cos(5 — 0)cos(—@) F sin(S — 0)sin(—g) 


= sind cosy + cos@ sing. 


This proves the second identity in part i). Specializing to the case 0 = 1/2, we 
obtain sin(z/2 — y) = cos gy. Thus parts i) and iii) have been proved. 
To establish part ii), notice by part i) and Theorem B.2 that 


cos(20) = cos(@ + 6) = cos’ 6 — sin? 6 = 2cos”6 — 1. 


Hence, cos?6 = (1 + cos(26))/2. Similar arguments establish the rest of 
part ii). a 


We close this section with the Law of Cosines, a generalization of the 
Pythagorean Theorem. 


B.4 Theorem. [LAW OF COSINES]. 
If T is a triangle with sides of length a, b, c, and 0 is the angle opposite the side 
of length c, then 


c =a’ +b* —2abcosé. 


Proof. Suppose without loss of generality that 6 is acute, and rotate T so b is 
its base. Let h be the altitude of T, and notice that / cuts a right triangle out 
of T whose sides are a and h and the angle opposite h is 0. By the definition 
of sin@ and cos 0, h = asin@ and the length d of the base of this right triangle 
is d = b — acos@. Substituting these values into the equation c? = h? + d? 
(which follows directly from the Pythagorean Theorem), we obtain 


c? = (asin6)* + (b —acos@)” 


=a’ sin? 6 +. a* cos? 6 +b? — 2abcos@ = a* + b* — 2abcosé. |_| 


C. MATRICES AND DETERMINANTS 


In this appendix we prove several elementary results about matrices and deter- 
minants. We assume the student is familiar with the concept of row and column 
reduction to canonical form. 
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Recall that an m x n matrix B is a rectangular array which has m rows and n 
columns: 


by, bya...) Dy 

ba, baa sw. oy, 
B= [bij \mxn — : : F A 

bmi bm2 tae bmn 


The notation 5;; indicates the entry in the ith row and jth column. We shall call 
B real if all its entries b;; belong to R. 


C.1 Definition. 


Let B = [bij|mxn and C = [cxe]pxq be real matrices. 
i) B and C are said to be equal ifm = p,n = q, and bj; = cj; fori 

2,...,m,and j =1,2,...,n. 

ii) The m x n zero matrix is the matrix O = Om xn = [bij]mxn, Where bj; = 0 
fori=1,...,m,and j =1,...,n. 

ili) The n x n identity matrix is the matrix I = Inyn = [bij |nxn, Where bj; = 1 
fori =1,.....,n, and bj = Ofori A], tj =1,.02,.n: 

iv) The product of a matrix B and a scalar a is defined by 


= 


aB= [bij |mxn- 


v) The negative of a matrix B is defined by —B = (—1)B. 
vi) When m = p andn = q, the sum of B and C is defined by 


B+C= [bij + Cij|mxn- 


vii) When n = p, the product of B and C is defined by 


n 
BC = bs ova] 
v=1 


mxq 


C.22 EXAMPLE. 
Compute B + C,3B,—C, BC, and CB, where 


1 O —-1 1 
Be) | and Cals AR 
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SOLUTION. By definition, 


1 -1 -—1 1 1 3 
-c=(} alk se=|"§ Al and ene | al Ba 


These operations do not satisfy all the usual laws of algebra. (For example, 
the last two computations show that matrix multiplication is not commutative.) 
Here is a list of algebraic laws satisfied by real matrices. 


C.3 Theorem. Let A = [a;;], B = [bjj], and C = [c;;] be real matrices and a, B 
be scalars. 


i) (@+ B)C =aC+ BC. 

ii) If B+ C is defined, thena(B +C)=aB+aC,andB+C=C+B. 

iti) If BC is defined, then a(BC) = (aB)C = B(aC). 

iv) If AB and AC are defined, then A(B + C) = AB+ AC. If BA and CA are 
defined, then (B+ C)A = BA+CA. 

v) IfA+ Band B+C are defined, then (A+ B)+C =A+(B+C). If AB and 
BC are defined, then (AB)C = A(BC). 

vi) If Bisanm x n matrix, then 


B+ Omxn = B, B—B=Onxn, 
BOnxq = Omxq> OpxmB = Opxn; and OB = Onmxn. 


vii) If B isann x n matrix, then 
InxnB = BlInxn = B. 
Proof. By definition, 
(a + B)C = [a+ B)cij] = [acij + Bcij] =aC + BC, 
and 
a(B+C) = a[bjj + cj] = [a(bij + cij)] = [edi] + lacjj] =aB+ac. 


A similar argument establishes B+ C =C+B. 
Let B be an m x n matrix and C be ann x q matrix. By definition, 


a(BC) = fe She = [Sarr = (WB)C. 
v=1 v=1 


A similar argument establishes a(BC) = B(a@C). 
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Let A be anm x n matrix and B, C ben x qg matrices. By definition, 


n 


n n 
A(B+C) = De div (byj + a] = bs aivbyj + Saver] = AB + AC. 
v=1 v=1 


v=1 


A similar argument establishes (B + C)A = BA+CA. 
Let A be anm x n matrix, B be ann x p matrix, and C be a p x q matrix. 
By definition, 


II 


(AB)C 


bp» cs [cjK] 
v=1 

P n 

> (Yeh) C jk 


j=l \v=1 
n P 
=| | aan) bugeae 
v=l1 j=l 
P 
=. bce = A(BC). 
j=l 


A similar argument establishes (A + B) +C =A+(B+C). 
By definition, 


B+ Omxn = [bij + 0) = [bij] = B, B-B=([bij — bij) = Onxn, 
n m 

BOnxq =a pe biv | = Omxqs OpxmB = 0-65 = Opxn; 
al v=1 


and 0: B = [0- bij] = Omxn. And, since J = [6;;], where 


1 i= j, 
oj = . z 
0 iF j, 
wehaved sn =(3")4 oie | = 077 |= BS Bie. a 


A square matrix is a matrix with as many rows as columns. Clearly, if B and C 
are square real matrices of the same size, then both B + C and BC are defined. 
This gives room for more algebraic structure. Ann x n real matrix B is said to 
be invertible if and only if there is ann x n matrix B-!, called the inverse of B, 
which satisfies 


BB-!=B-!Be=I. 


The following result shows that matrix inverses are unique. 
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C.4 Theorem. Let A, B ben xn real matrices. If B is invertible and BA = I, 
then B-! = A. 


Proof. By Theorem C.3 and definition, 


Bo! =B'1=B7'(BA) =(B'B)A=IA=A. | 


If B = [bij ]nxn is Square, recall that the minor matrix B;; of B is the (n — 1) x 
(n — 1) matrix obtained by removing the ith row and the jth column from B. 
For example, if 


1 2 3 
B=|-1 —2 -3), 
4 5 6 
then 
2 3 
m=(2 3. 


Minor matrices can be used to define an operation on square real matrices (the 
determinant) which makes invertible matrices easy to identify (see Theorem C.6 
below). 

The determinant can be defined recursively as follows. Let B be ann x n real 
matrix. 


i) Ifn = 1, then the determinant of B is defined by det[b] = b. 
ii) Ifn = 2, then the determinant of B is defined by 


det i ‘| —ad — be. 


iii) If n > 2, then the determinant of B is defined recursively by 
det[DijInxn = bi1 det By — biz det Byy +--+ (—1)""' din det Bin, 


where B,; are minor matrices of B. 
The following result shows what an elementary column operation does to the 
determinant of a matrix. 


C.5 Theorem. Let B = [bj] and C = [cjj] ben x n real matrices, n = 2. 
i) If C is obtained from B by interchanging two columns, then det C = — det B. 
ii) If C is obtained from B by multiplying one column of B by a scalar a, then 
det C = a det B. 
iii) If C is obtained from B by multiplying one column of B by a scalar and 
adding it to another column of B, then det C = det B. 


Proof. Since 


det |¢ ‘| = ad — be = ~(be ~ ad) = det 7 A 
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part i) holds for 2 x 2 matrices. Suppose that part i) holds for (nm — 1) x (n — 1) 
matrices. Suppose further that there are indices jg < j, such that bjj, = 
cij, and jj, = cj fori = 1,...,n. By the inductive hypothesis, detC,; = 
— det B); for j # jo and j F ji, 


det Cj jy = (—1)/1~! det By;,, and det Cy j, = (—1)~1*" det By jp. 
Hence, by definition, 


det C = cj; det Cy, — cy2 detCy2 +--+ + Gly as det Cin 
= —by, det By; + biz det Biz + --- — (—1)""! Diy det Bin = — det B. 


Thus i) holds for all € N. Similar arguments establish parts ii) and iii). 


In the same way we can show that Theorem C.5 holds if column is replaced 
by row. It follows that we can compute the determinant of a real matrix by 
expanding along any row or any column, with an appropriate adjustment of 
signs. For example, to expand along the ith row, interchange the ith row with 
the first row, expand along the new first row, and use Theorem C.5 to relate 
everything back to B. In particular, we see that 


det[bijInxn = (—1)'t'bj1 det Bj) + (—1)'**bj2 det Byr +--+ (-1)' Din det Bin. 


The numbers (—1)'*/ det B; j are called the cofactors of b;; in det B 

The operations in Theorem C.5 are called elementary column operations. 
They can be simulated by matrix multiplication. Indeed, an elementary matrix is 
a matrix obtained from the identity matrix by a single elementary column oper- 
ation. Thus elementary matrices fall into three categories: E(i = /), the matrix 
obtained by interchanging the ith and jth columns of J; E(ai), the matrix ob- 
tained by multiplying the ith column of J by a 4 0; and E(qai + /), the matrix 
obtained by multiplying the ith column of J by a ¥ 0 and adding it to the jth 
column. Notice that an elementary column operation on B can be obtained by 
multiplying B by an elementary matrix; for example, E(i < j)B is the matrix 
obtained by interchanging the ith and jth columns of B. 

These observations can be used to show that the determinant is multiplicative. 


C.6 Theorem. /f B,C aren x n real matrices, then 
det(BC) = det B det C. 
Moreover, B is invertible if and only if det(B) 4 0. 
Proof. It is easy to check that 


det(E(i @ j))=—1, det(E(ai))=a, and det(E(ai + j)) =1. 
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Hence, by Theorem C.5, 
det(E A) = det E det A (7) 


holds for any n x n matrix A and any n x n elementary matrix E. 

The matrix B can be reduced, by a sequence of elementary column opera- 
tions, to a matrix V, where V = / if B is invertible, and V has at least one zero 
column if B is not invertible (see Noble and Daniel [9], p85). It follows that 


there exist elementary matrices F),..., £, such that A = E,... E)V. Hence, 
by (7), 
det(B) = det(E,... EpV) 
= det(£)) det(E2...E,V) =--- = det(E,... E,)det(V). 


In particular, B is invertible if and only if det B £4 0. 
Suppose that B is invertible. Then V = J and by (7), 


det(BC) = det(E) ... Ep)det(VC) = det B det C. 


If B is not invertible, then BC is not invertible either (see Noble and Daniel 
[9], p204). Hence, det(BC) = 0 and we have 


det(BC) = 0 = det B det C. | 


The transpose of a matrix B = [b;;] is the matrix B’ obtained from B by 
making the ith row of B the ith column of B’; that is, the (i x j)th entry of B” is 
b;;. The adjoint of ann x n matrix B is the transpose of the matrix of cofactors 
of B; that is, 

adj(B) = [(—1)'*/ det B;;]’. 


The adjoint can be used to give an explicit formula for the inverse of an in- 
vertible matrix. 


C.7 Theorem. Suppose that B is a square real matrix. If B is invertible, then 


1 
B= dj(B). 8 
Tet Bo ) (8) 


Proof. Set [cj;] = B adj (B). By definition, 

cig = (— 1) by By He + (HD bin Bin. 
If i = j, then c;; is an expansion of the determinant of B along the ith row of 
B (ie., ci = det B). Ifi A j, then c;j is a determinant of a matrix with two 


identical rows so c;; is zero. It follows that 


B adj(B) = det B - I. 
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We conclude by Theorem C.4 that (8) holds. a 


Pl ge el ©) 


The following result shows how the determinant can be used to solve systems 
of linear equations. (This result is of great theoretical interest but of little practi- 
cal use because it requires lots of storage to use on a computer. Most packaged 
routines which solve systems of linear equations use methods more efficient than 
Cramer’s Rule; e.g., Gaussian elimination.) 


In particular, 


C.8 Theorem. [CRAMER’S RULE]. 


Let cy, c2,...,€n € Rand B = [bjj|nxn be a square real matrix. The system 
byx1 + Dyaxq + +++ + Dinkn = C1 (10) 
b21x1 + by2x2 +--+ + banxn = €2 


bnix1 + bnax2 + +++ + BanXn = Cn 


of n linear equations in n unknowns has a unique solution if and only if the 
matrix B has a nonzero determinant, in which case 


det C(j) 
xj= , 
det B 


where C(j) is obtained from B by replacing the jth column of B by the column 
matrix [c, ...¢n]’. In particular, if c; = 0 for all j and det B ¥ 0, then the 
system (10) has only the trivial solution x; = 0 for j = 1,2,...,n. 


Proof. The system (10) is equivalent to the matrix equation 
BX =C, 


where B = [b;j], X = [x1...x,]’, and C = [c)...cy]’. If det B # 0, then by 
Theorem C.7, 


X=B'!c= 


1 
dj(B)C. 
det B adj(B) 
By definition, adj(B)C is a column matrix whose jth “row" is the number 


(-1)'t cy det By; + (-1)?*"cp det Boj +++» + (-1)"14 cn det By; = det C(j). 


[We expanded the determinant of C(j) along the jth column.] Thus x; = 
det C(j). 

Conversely, if BX = C has a unique solution, B can be row reduced to /. 
Thus B is invertible (i.e., det B 4 0). a 
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D. QUADRIC SURFACES 


A quadric surface is a surface which is the graph of a relation in R? of the form 


Ax? + By? +Cz*+ Dx +Ey+Fz+Gxy + Hyz+Izx = J, 


where A, B,..., J € Rand not all A, B,C, G, H, I are zero. We shall only con- 
sider the cases when G = H = | = 0. These include the following special types. 


1. The ellipsoid, the graph of 
Ax? + By? + C2? =1, 


where A, B, C are all positive 
2. The hyperboloid of one sheet, the graph of 


Ax? + By? + C2? =1, 


where two of A, B, C are positive and the other is negative. 
3. The hyperboloid of two sheets, the graph of 


Ax? + By? + C2? =1, 


where two of A, B, C are negative and the other is positive 
4. The cone, the graph of 


Ax? + By* + Cz? =0, 


where two of A, B, C are positive and the other is negative 
5. The paraboloid, the graph of 


z= Ax? + By’, 


where A, B are both positive or both negative 
6. The hyperbolic paraboloid, the graph of 


z= Ax? + By’, 


where one of A, B is positive and the other is negative 
The trace of a surface S in a plane J7 is defined to be the intersection of S 
with /7. Graphs of many surfaces, including all quadrics, can be visualized by 
looking at their traces in various planes. We illustrate this technique with a 
typical example of each type of quadric. 


D.1 EXAMPLE. 
The ellipsoid 3x? + y* + 2z* = 6. 
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AZ 


FIGURE D.1 


SOLUTION. The trace of this surface in the xy-plane is the ellipse 3x7 + y? = 6. 
The trace of this surface in the yz-plane is the ellipse y? + 2z2 = 6. And the 
trace of this surface in the xz-plane is the ellipse 3x7 + 2z7 = 6. This surface is 
sketched in Figure D.1. | 
D.2, EXAMPLE. 


The hyperboloid of one sheet x” + y? — z? = 1. 


Az 


FIGURE D.2 


SoLuTION. The trace of this surface in the plane z = a is the circle x7 + y? = 
1+’. The trace of this surface in x = 0 is the hyperbola y* — z? = 1. This 
surface is sketched in Figure D.2. | 
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D.3 EXAMPLE. 
The hyperboloid of two sheets x? — y? — z? = 1. 


FIGURE D.3 


SOLUTION. The trace of this surface in the plane z = 0 is the hyperbola 
x? — y? =1. The trace of this surface in y = 0 is the hyperbola x? — z* = 1. 
This surface has no trace in x = 0. This surface is sketched in Figure D.3. a 


D.4 EXAMPLE. 


The cone 2? = x? + y?. 


<Y 


FIGURE D.4 
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SOLUTION. The trace of this surface in the plane z = a is the circle x7+y* = a’. 
The trace of this surface in y = 0 is a pair of lines z = +x. This surface is 
sketched in Figure D.4. | 


D.5 EXAMPLE. 


The paraboloid z = x? + y?. 

SOLUTION. If a > 0, the trace of this surface in the plane z = a is the circle 
x? 4+ y? =a. The trace of this surface in y = 0 is the parabola z = x”. This 
surface is sketched in Figure D.5. a 


FIGURE D.5 


FIGURE D.6 
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D.6 EXAMPLE. 


The hyperbolic paraboloid z = x? — y?. 

SoLuTION. The trace of this surface in the plane z = a is the hyperbola 
a = x* — y*, (It opens up around the xz-plane when a > 0, and around the yz- 
plane when a < 0.) The trace of this surface in the plane y = 0 is the parabola 
z = x’. This surface is sketched in Figure D.6. (Note: The scale along the x-axis 
has been exaggerated to enhance perspective, so the hyperbolas below the z = 0 


plane are barely discernible.) a 


E. VECTOR CALCULUS AND PHYSICS 
Throughout this appendix C = (g, J) is a smooth arc in R’, S = (W,E)isa 


smooth surface in R*, {to,..., ty} is a partition of J, and {R,..., Ry} is a grid 
on E. 
Ri y (Rj) 
Av ae Av: YW, 
(uy) AM Au: Yi 
FIGURE E.1 


E.1 Remark. The integral 


II do -| |Ny-(u, v)|| du, v) (11) 
S E 


can be interpreted as the surface area of S. 

Let (u;,v;) be the lower left-hand corner of R; and suppose that R; has 
sides Au, Av (see Figure E.1). If R; is small enough, the trace of each piece 
S; = (w, Rj) is approximately equal to the parallelogram determined by the vec- 
tors Au W, and Av wy. Hence, by Exercise 8.2.7, 


A (Sj) © [|(Au wy (uj, vj) x (Av Wj, vj))II 
= ||Ny@j, vj) || AuAv = [Ny (uj, vj) Ril. 


Summing over j, we obtain 
N 
A(S) © YO INy Uj, vpiRil, 
j=l 


which is a Riemann sum of the integral (10). 
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E.2 Remark. Jf w is a thin wire lying along C whose density (mass per unit 
length) at a point (x, y) is given by g(x, y), then 


[sas 
Cc 


can be interpreted as the mass of w. 
Since mass is the product of density and length, an approximation to the mass 
of the piece of w lying along Cy = (@, [t-1, t,]) is given by 
tk 
S(t) LIC) = / g(te) IP’ (o) || dt 


tk-1 


(see Definition 13.9). Summing over k, an approximation to the mass of w is 


Noahs 
> / g(t ilo' (0) || dt, 
j=l tk-1 


which is nearly a Riemann sum of the integral 


[sooo a= | gds. 
I C 


The following remark has a similar justification. 


E.3 Remark. J/f S is a thin sheet of metal whose density at a point (x, y, Zz) is 
given by g(x, y, z), then 
i i, gdo 
Ss 
can be interpreted as the mass of S. 

Work done by a force F acting on an object as it moves a distance d is 
defined to be W = Fd. There are many situations where the force changes 
from point to point. Examples include the force of gravity (which is weaker 
at higher altitudes), the velocity of a fluid flowing through a constricted tube 
(which gets faster at places where the tube narrows), the force on an electron 


moving through an electric field, and the force on a copper coil moving through 
a magnetic field. 


E.4 Remark. [fan object acted on by a force F : R* + Rmoves along the curve 
C = (@, I), then the unoriented line integral 


[Fa 
C 


can be interpreted as the work done by F along C. 
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An approximation to the work done along Cx = (@, [t—-1, t]) is 


tk 


Wie © FOGG) — 6(-DIl = / 


th 


F (b(t) IOI at. 


Summing over k, we find that an approximation to the total work along C is 
given by 
tk 


N 
Yop Fe) Ie Olde, 
k=1 


th-1 


which is nearly a Riemann sum of the integral 


[ Feoe'ol a= | Fads. 
i a 


The following remark explains why F - T is called the tangential component 
of F and F - nis called the normal component of F. 


E.5 Remark. Let u be a unit vector in R* (respectively, R°) and F be a function 
whose range is a subset of R? (respectively, R*). If ¢ is the line in the direction u 
passing through the origin, then |F - ul is the length of the projection of F onto € 
(see Figure E.2a). 


FIGURE E.2a 


Let 6 represent the angle between u and F. By (3) in Section 8.1, 
|F- ul = cos @|| F'| |lul| = cos @|| Fl]. 


Hence, by trigonometry, | F'- ul is the length of the projection of F onto £. Notice 
that F - wis positive when @ is acute and negative when @ is obtuse. 
Combining Remarks E.4 and E.5, we see that f., F - T ds represents the work 


done by the tangential component of a force field F : R*? —> R? along C. 


E.6 Remark. /f S = (yw, E) is a thin membrane submerged in an incompressible 
fluid which passes through S, and F(x, y, z) represents the velocity vector of the 
flow of that fluid at the point (x, y, z), then the oriented integral of F -n can be 
interpreted as the volume of fluid flowing through S in unit time. 
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Let {E;} be a grid which covers E, and let h be the length of the line segment 
obtained by projecting F onto the normal line to S at a point (x;, yj, zj) € W(E;) 
(see Figure E.2b). If E; is so small that F is essentially constant on the trace of 
S; = (W, E;), then an approximation to the volume of fluid passing through S$; 
per unit time is given by 


V; =A(S))-h =A (Sj) F(x, yj. Zj) 0 
=A(S/)F(WU;, 0j)) > Ny @;, vp /INv@;, vp. 


w (Ej) 


FIGURE E.2b 


Summing over j and replacing A (Sj) by ||Ny|| |E;| (see Remark E.1 above), we 
see that an approximation to the volume V of fluid passing through S per unit 
time is given by 


N 
Yo Fu; vj) Ny uj, vp lEjl. 


j=l 


This is a Riemann sum of the oriented integral 
[[ Fova.on nya. naa = ff Fade. 
E s 


F. EQUIVALENCE RELATIONS 


A partition of a set X is a family of nonempty sets {Eq}qe4 such that 


x= |) Ey and Ey NE, =G 


acA 


fora # B. A binary relation ~ on X is a subset of X x X. If (x, y) belongs to ~, 
we shall write x ~ y. Examples of binary relations include = on R, < on R, and 
“parallel to" on the class of straight lines in R?. 

A binary relation is called an equivalence relation if it satisfies three additional 
properties. 
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[THE REFLEXIVE PROPERTY] For every x € X,x ~ x. 
[THE SYMMETRIC Property] If x ~ y, then y ~ x. 
[THE TRANSITIVE Property] If x ~ y and y ~ z, then x ~ z. 


Notice that = is an equivalence relation on R, “parallel to" is an equivalence 
relation on the class of straight lines in R?, but < is not an equivalence relation 
on R (it fails to satisfy the Symmetric Property). 

If ~ is an equivalence relation on a set X, then 


xi={yeX:y~x} 
is called the equivalence class of X which contains x. 


Fl Theorem. [f ~ is an equivalence relation on a set X, then the set of equiva- 
lence classes {x : x € X} forms a partition of X. 


Proof. Since ~ is reflexive, each equivalence class ¥ contains x (i.e., X is 
nonempty). Suppose that +N y # & (i.e., that some z € X belongs to both 
these equivalence classes). Then z ~ x and z ~ y. By the Symmetric Property 
and the Transitive Property, we have x ~ y (i.e., y € x). By the Transitive 
Property, it follows that y C x. Reversing the roles of x and y, we also have 
x Cy. Thus x = y. a 
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Answers and Hints to Selected 


Exercises 
CHAPTER 1 
1.2 Ordered Field Axioms 
0. b) False. d) True. It’s vacuous. 
1. b) Consider the cases c = Oandc £0. 
2. To prove (7), multiply the first inequality in (7) by c and the second 
inequality in (7) by b. Prove (8) and (9) by contradiction. 
4. a) (—5, 6). b) (—3, 4). c) (1/2, 00). d)(—00, 2) U (4, 00). e) (1, 1). 
5. a) Apply (6) to a — 1. b) Apply (6) to a — 2. c) Apply (6) to 1 — a. 
d) Apply (6) to a — 2. 
6. Observe that (/a — Vb) > 0. 
7. Factor. Then use Remark 1.5 and/or the triangle inequality. 
8. a)n > 25.b)n EN. c)n > 32. 
9. a) Use uniqueness of multiplicative inverses to prove that (nq)~! = 
n—'q~!. b) Use part a). c) Use proof by contradiction for the sum. 
Use a similar argument for the product, and identify all rationals g such 
that xq € Q for a given x € R\ Q. d) Use the Multiplicative Properties. 
10. Show first that the given inequality is equivalent to 2abcd < b*c? +a*d?. 
11. a) The Trichotomy Property implies i); the Additive and Multiplicative 


Properties imply ii). 


1.3 The Completeness Axiom 


COD NW 


9. 
10. 


. b) True. d) False. 
. a) inf E = —5, sup E = 2. b) inf E = 10/3, sup E = 10. c) inf E = V2, 


supE = 2. d)infE = —1, supE = 1/2. e) infE = 0, supE = 1. 
f) inf E = 4, sup E = 6. 


. Notice that a — /2 < b — V2, and use Exercise 1.2.9c. 

. See Theorem 1.15. 

. b) Apply Theorem 1.14 to —E. 

. After showing that sup A and sup B exist, prove that max{sup A, sup B} < 


sup E. 
Use the proof of Theorem 1.18 as a model. 
th<b<.... 


1.4 Mathematical Induction 


0. 
Li, 


c) True. 
Compare with Exercise 1.2.5. 
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3. c) You may use part a). 
8. Use the Binomial Formula. 
9. b) Show that n* + 3n cannot be the square of an integer when n > 1. 
c) The expression is rational if and only if n = 9. 
10. This recursion, discovered by P. W. Wade, generates all Pythagorean 
triples a, b, c which satisfy c — b = 1. 


1.5 Inverse Functions and Images 


0. a) False. It’s 7 — arcsin(x). d) False. 

1.i) f-l'@) = @-4/2. ii) f-'@ = —-I/logx. iii) f-'@) = 
arctan (x/2). iv). f~! (x) = 2—Jx +3. v) fo! (x) = (lxl—|x + 3/4:3x)/3. 
vi) f~' ) = (1— JT — 4x (&« — D) /2x when x 40, and f~! (0) = 1. 

2. a) f(E) = (5,25), f-'(E) = (4/10,7/10). b) f(E) = [-2,2], 
f'@® = [-2,-1)U G,2]. ¢) f() = [0,4], f '®) = [0.4]. d) 
f (E) = [0,log (10)], f-! (E) = [2 - ve>=1,2) U (2,24 VeF=1]. 


e) f (E) =[-1, 1], f! (E) = Uy, <7 [2ka, (2k + 1) x]. 
. a) [—1, 2]. b) {0}. c) [0, 1). d) [1, 2]. e) {1}. f) (—ow, 00). 
4. First prove that f(A)\ f(B) C f(A\B) and A C f'(f(A)) hold whether 
f is 1-1 or not. 


1.6 Countable and Uncountable Sets 


0. b) False. d) False. 
4. If dis 1-1 from a set E into A, is W(x) := f(¢(x)) 1-1 from E into B? 
6. a) Prove it by induction on n. b) Use Exercise 1.6.5. 


Go 


CHAPTER 2 
2.1 Limits of Sequences 


0. b) False. d) False. 

2. a) Apply Definition 2.1 with ¢/4 in place of e. b) First prove that 1/x, < 2 
for n large. 

5. Definition 2.1 works for any positive e including ¢/C when C > 0. 


2.2 Limit Theorems 


0. b) True. c) False. d) You may wish to use derivatives to show that 2* — x 
is increasing on [2, oo). 

1. c) You may use Exercise 2.2.5. d) First prove that n? < 3” forn=1, 
2,853 

. a) —5/6. b) 1. c) oo. d) 4. 

. You may wish to prove that /%, — J/x = (% — x)/(/%n + VX). 

. Use Theorem 1.18. 

. If x = limpsoo Xp exists, what is limp Xn41? 


CON NW 


2.3 The Bolzano-—Weierstrass Theorem 


0. b) You only need to prove that {x,} has a convergent subsequence, not 
actually find it. d) False. 
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4. Prove that {x,} is monotone. 
8. a) See Exercise 1.2.6. 


2.4 Cauchy Sequences 


0. a) False. c) True. 

1. You may use Theorem 2.29. 

5. Use Exercise 2.4.4. 

6. Is it Cauchy? (Compare with Example 2.30 and see Exercise 1.4.4c.) 
8. a) Use the Bolzano—Weierstrass Theorem. 


2.5 Limits Supremum and Infimum 


1. a) 0,2. b) —1, 1. c) 0,0. d) 1, 1. e) 0, 0. f) —o0, 00. g) 00, co. 
4. a) First prove that infys, x, +infesn ye < infesn (xe + yx). C) By b), the first 
and final inequalities can only be strict if neither {x,} nor {y,} converges. 


7. Let s = inf,en(supys, xx) and consider the cases s = oo, s = —oo, and 
seER. 
8. Let s = liminf,-... x, and consider the cases s = oo, s = 0, andO < 
s< OO. 
CHAPTER 3 


3.1 Two-Sided Limits 


0. b) False. c) False. 
1. See Example 3.3. 
2. b) The limit is zero. c) Does it exist as an extended real number? Why 
not? 
. a) —2. b) n. c) 0. d) 0. [Recall that sin 2x = 2 sin x cos x.] e) 0. 
. b) Use Exercise 3.1.6. 
. b) Use Exercise 3.1.6. 


3 
7 
8 
3.2 One-Sided Limits and Limits at Infinity 
0 
1 


. a) False. c) True. 
. d) You may wish to multiply by —1/ — 1 to make the numerator positive 
asx > IT, 

2. a) 13/6. b) 3/2. c) 1. d) —oo. e) —o0. f) V2/2. 

3. b) Use Theorem 3.8. 

5. Prove that if f(x) does not converge to L as x — ov, then there is a 
sequence {x,} such that x, — oo but f(x,) does not converge to L as 
n> oOo. 

6. See Exercise 2.2.6. 


3.3 Continuity 
0. a) True. Show that J = [a, 8], where a = inf{f(x) : x € [a,b]} and 
B = sup{ f(x) : x € [a, b]}. c) True. 
1. a) and d) Recall that ./x is continuous on [0, 00). 
2. c) Recall that 2* = e* 982, 


664 Answers and Hints to Selected Exercises 


8. 


ay. 


b) Use part a) to show that f(x) = f(mx/m) = mf (x/m) first. d) If the 
statement is true, then m must equal f (1). 
a) Begin by showing that f(0) = 1. 


3.4 Uniform Continuity 


0. 
1. 
5: 


\o © 


d) True. (See Exercise 3.4.5.) 

c) Recall that sin 2x — sin 2a = 2 sin(x — a) cos(x +a) and that | sin6| < |6]. 
c) and e) Prove that f(x) = x and g(x) = x? are both uniformly continu- 
ous on (0, 1) but only one of them is uniformly continuous on [0, 00). 


. a) This is a function analogue of the Monotone Convergence Theorem. 
. You may wish to prove that if P(x) = a,x" +---+ ao is a polynomial of 


degree n > 1 whose leading coefficient satisfies a, > 0, then P(x) > oo 
asx > O. 


CHAPTER 4 
4.1 The Derivative 


0. 
4. 
5. 


a) False. d) True. 
Use Definition 4.1. 
a) a =ka and b =a —2(-1)* fork € Z. b) (—1, 7) and (1, 3). 


4.2 Differentiability Theorems 


OANrR OO 


. b) True. 

. a) b+ 2d. b) —a/5. c) ac. d) bd. 

. a) 6+ 54m. b) 2e/3. c) (2 —e)/12. 

. No, f is not differentiable at 0. 

. a) Use assumptions ii) and vi) to prove that sinx — 0 asx — 0. Use 


assumption iii) to prove that cosx — 1 asx — 0. b) First prove that 
sinx = sin(x — x9) cos x9 +cos(x — x9) sin xq for any x, x9 € R. c) Assump- 
tion vi) and 0 < 1—cosx < 1 —cos* x play a prominent role here. d) Use 
assumption iv) and part c). 


4.3 The Mean Value Theorem 


0. a) True. (But you cannot use Theorem 4.17i.) b) False. 
10. a) Compare with Exercise 4.1.8. 
11. This exercise has nothing to do with the Mean Value Theorem. 
12. Use Darboux’s Theorem and Theorem 4.18. 


4.4 Taylor’s Formula and l’H6pital’s Rule 


0 
1 
2. 
5 


. a) False. 
.c)n=5, 
c) n = 1999. 


. a) —3. b) 2/log2. c) 1. d) 1. e) —1. f) 1. g) —8. h) 1 (Rationalize 


the numerator and denominator. L’Hopital’s Rule only makes matters 
worse.) 


6. 


7. 
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a) Use one-variable calculus techniques to maximize f(x) = log x/x® for 
x €[1, oo). 
b) First prove that if g(x) = eH? jk for some k € N, then g(x) > 0 as 
x — 0. Next, prove that given n € N, there are integers N = N(n) € N 
and ay = ae € Zsuch that 

pg) = |Lteo/ee x £0 
0 x =0. 


(Note: Although for each n € N many of the a,;’s are zero, this fact is not 
needed in this exercise.) 


4.5 Inverse Function Theorems 


0. a) False. c) False. 
1. a) 1/3. b) —1/4. c) —5/6. 
2. b) 1/6e. 
3. Observe that if x = sin y, then cos y = V1 — x?. 
9. f(x) =+/ax +c for somec €R. 
11. Use Darboux’s Theorem. 
CHAPTER 5 


5.1 The Riemann Integral 


0 
1 
4. 
5 
8 
9 


. a) False. 

. a) L(f, P) = 27/8. b) U (f, P) = 133/8. c) L(f, P) © 0.0054542. 

a) Look at the proof of Lemma 3.28. 

. First show that f , f (x) dx = 0 for all subintervals / “ [a, b]. 

. Notice that |x; — xj- 1| < ||P || for each j = 1, 2,. 

. Prove that there is an absolute constant C > 0 a that Mj(./f) — 


mj(/f) < C(Mj(f) — mj(f)). 


5.2 Riemann Sums 


CONN 


. b) True. c) True. 

. a) 13/2. b) 15. c) 97/2. d) 24 + 97/2. 

. a) See Exercise 4.4.4 and use three nonzero terms for the upper bound. 
b) Use the Taylor polynomial with four nonzero terms. 

. Do not forget that f is bounded. 

. b) You may use the fact that fx” dx =x"t!/(n +1). 

. a) If | f(xo)| > M —e/2 for some xo € [a, b], can you choose a nondegen- 
erate interval J such that | f(x)| > M —e for all x € 1? b) See Example 
2.21. 


5.3 The Fundamental Theorem of Calculus 


0. b) True. c) True. 
1. a) —f (x). b) 12x73 f (3x*) —2xf (x). c) x sinxf (x sinx) (sinx + x cos x). 
d) 2f (2x). 
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3. a) —1/2. b) —4. 

6. Use the Fundamental Theorem of Calculus. 

1. The hard part is showing that f% is integrable on [a,b]. You may use 
Exercise 4.2.7a. 

12. First show that log(a,) is a Riemann sum. 


1 


5.4 Improper Riemann Integration 


0. c) True. 

1. a) 1. b) 1/8. c) 4. d) —1/(@ +4 1)’. 

2.a)p>1.b)p<l.c)p>1.d)p>lej)p>l. 

3. Compare with Example 5.44. 

4. a) Diverges. b) Diverges. c) Converges. d) Converges. e) Converges. 

9. Integrate by parts first. 

0. You might begin by verifying sinx > /2/2 for x € [/4, 1/2] and sinx > 
2x/m for x € [0, 7/4]. 


5.5 Functions of Bounded Variation 


1. c) Find a partition where the jumps are large. 
4. Combine Theorems 4.18 and 3.39. 
9. For the bounded case, prove that (L) / sa | f'(x)|dx < Var f < 


(U) f? |f'@ldx. 
5.6 Convex Functions 
5. Use Remark 5.60. 


CHAPTER 6 
6.1 Introduction 


. a) False. d) True. 

. a) 2. b) —4/5. c) 125/6. d) 3/5. 

. a) —2. b) —3 c) log(3/4). d) 2 sin6. 

|x] <1. 

. b) Consider the Geometric Series. 

. a) Is nary, < po, ak? 

. c) Notice that if the partial sums of )°?° ; by are bounded, then b = 0. 

. b) See Exercise 6.1.9b. d) First prove that if a, > 0 and 77°, ax diverges, 
then >of) ak = 00. 

11. Use Corollary 6.9. 


6.2 Series with Nonnegative Terms 


0. b) True. c) True. 

1. c) If p > 1, are there constants C > 0 and q > 1 such that logk/k? < 
Ck-4? 

2. b) No, you cannot apply the p-Series Test to 1/k!~!/* because the expo- 
nent p := (1—1/k) is NOT constant but depends on k. d) Try the Integral 
Test. 


SODDNNFR © 


4. 
8. 
10. 
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It converges when p > | and diverges when 0 < p < 1. 
It diverges when 0 < g < 1 and converges when g > 1. 
Recall that a? := e?!°8¢ and observe that log(log(logk)) + co ask > oo. 


6.3 Absolute Convergence 


6.4 


6.5 


6.6 


0. 
2: 


3. 


9. 
10. 


b) False. c) False. 

a) Convergent. b) Convergent. c) Convergent. d) Convergent. e) Con- 
vergent. f) Convergent. g) Divergent. 

a) (1, 00). b) J. c) (—00, —1) U (1, 00). d) (1/2, 00). e) (1, 00). 

Dee, log (e)) (use Stirling’s Formula when p = log,(e)). 

m~/8. 

See Definition 2.32. 


Alternating Series 


8. 
9. 


. a) True. c) False. 
. b) See Example 6.34. 


. a) [-1, 1). b) (-V3, 73). c) (-1, 1. d) [1, 2]. 
. a) Absolutely convergent. b) Absolutely convergent. c) Absolutely con- 


vergent. d) Conditionally convergent. e) Absolutely convergent. 


. Is it Cauchy? 
. Letce = ek a;b; and apply Abel’s Formula to 


m m 
So ax — Soc = Ce+1/Dr. 


k=n k=n 


See Example 6.34. 
See Example 6.34. 


Estimation of Series 


1. 


2: 
3. 


a) At most 101 terms. b) At most 8 terms. c) At most 3 terms. (To prove 
that {a;} is monotone, verify that ag41/az < 1.) 

a)p>l. 

a)n=6.b)n =31.c)n = 38.d)n=5. 


Additional Tests 


ds 
2. 


a) Divergent. b) Absolutely convergent. c) Divergent. d) Divergent. 

a) Absolutely convergent for p < 0 and divergent for p > 0. b) Abso- 
lutely convergent for p < 0 and divergent for p > 0. c) Absolutely con- 
vergent for |p| > e, conditionally convergent for p = —e, and divergent 
otherwise. (Use Stirling’s Formula when p = +e.) 


. It actually converges absolutely. 
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CHAPTER 7 


7.1 Uniform Convergence of Sequences 


2; 
6. 


8. 


9. 


a) (47! — 1) /21. b) 2. c) 7.014. 

This is different from Theorem 7.9 because when E is not compact, uni- 
form continuity is not the same as continuity. 

Modify the proof of Example 4.22 to show that (1+x/n)” + e* asn — oo. 
To prove that this is a uniform limit, choose N so large that [a,b] C 
[—N, N] and find the maximum of e* — (1 + x/N)% on [a, b]. 

a) Use Exercise 7.1.5c. 


7.2 Uniform Convergence of Series 


. a) Recall that |cos@ — 1| < |@| for all@ ER. 

. Dominate, then telescope. 

. Is there a connection between )-7°, k~! sin(xk~!) and S°?2, cos(xk7!)? 
. Use Abel’s Formula. 

. See Example 6.34. 


7.3 Power Series 


1. 
2: 


7. 


9. 
10. 
11. 


a) R=1.b)R=1/2.c)R=1/5.d)R=1. 

a) (—25, 25). b) (5/6, 7/6). c) [—2, 0). d) [-V2, V2]. (Use Raabe’s Test 
for the endpoints.) 

a) f (x) = 2x4/(1—x°) for x € (-1, 1). b) f (~) = (6 —3x)/(L—x)? 
forx € (1,1). c) f@) = 5(logx + $-1)/-2) for x € (0,2), 
x # 1,and f(1) = 0. d) f(x) = log(1/(1 — x°))/x° for x € [—-1, 1), 
x £0, and f(0) = 1. 


. Use Exercise 2.5.8 to prove that if lim sup |ag/ag41| < R, then there is 


anr < R such that {|axr*|} is increasing for k large (i.e., that yi a,r* 


diverges for some r < R). 

Use the method of Example 7.36 to estimate | f’(x)|. 

First prove that the radius of convergence of )772. axx* is > 1. 

a) Use Theorem 6.35 to estimate log(n!) = )-7_,logk. b) x € 


(—1/e, 1/e). 


7.4 Analytic Functions 


1. 


a) sin (3x) — 3x + 9x3/2 = WP, (-1)¥(3x)7**! / (2k + DI. 
b) x75" = Yo 4 x" lop? 5) = 3)! 

c)1—2sin? x = 17, (—4)* (x)* / (2k)!. 

d) (e?* — 1) /2x = Wo 2kxk/ (K+ DL 


Pajay (1 + x3) => 4 epee. 


b) e*/ +4) = Deo (Dhue H) x4 

c) log (1/ |x? — 2x + 1]) = Spo 2x*/k. 

d) arccosx = )°7° 5 ea (1 (x2k*! — 1) / (2k + 1). (Use Theo- 
rem 7.52.) 


. a) 2e* = VP 2e (x — 1)* / ki, valid for x € R. 


b) log; (x4) = 4 72, (—D**! (& — D* / (k log 3), valid for x € (0, 2]. 
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c) x3 —x742=24 (4-1) 4+2(¢—-1)?+ (4 — 1), valid for x ER. 
d) Use Raabe’s Test. x =14+(x-—1)/24+ 07%, (-D*! 1-3--- 
(2k — 3)) / (2-4--- (2k)) (x — 1), valid on [0, 2]. 

. Let f, be a Taylor polynomial of f and compute the integral of f - fh. 
Question: If f, — f uniformly and f is bounded, does f - fy > f? 
uniformly? 


10. See Exercise 5.1.4 and use analytic continuation. 
11. First use the Binomial Series to verify that (1 + x)? > 1+ x for any 


O<x <1. 


7.5 Applications 


1 


The first seven places of the only real root are given by —0.2580558. 


6. Choose rp as in the proof of Theorem 7.58, define {x,} by (19), and find a 


5 so that | f(xo)| < 4 implies |x — xn—1| < rg*- 


CHAPTER 8 
8.1 Algebraic Structure 


4. 
6. 
7. 
8. 


Use (2). 
b) (a, 2a, 2a), a # 0. ¢) (a, (5a — 16) /2, (32 — 7a) /2),a £0. 
arccos(1/,/n). 


b) For part vi), write |x x y||? = (x x y) - (x x y) and use parts iv) and v). 


8.2 Planes and Linear Transformations 


2. 


a)x+y—2z+3w = 1.b) 2x —2y —2z+w = 1.c) Use Exercise 1.4.4a. 
1 oO 1 0 
a) 1 -1 0 0 
0 oO 00 
—1 1 01 
b) [1 1 -1] 
c) 10... 0 O 
10... 0 -1 {° 


5. Use the linear property to compute the T (e;)’s. 


-1 1 
a) A= 1 3 
xz 0 


een ee | 
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\4=p i 3 | 


for any choice of a,b ER. 


a l-a 5 —2 
d) A=|b 2-—b 3-1 
c 7-c¢ 1 1 


for any choice of a,b ER. 

. Don’t be an algebra zombie. Use a, b, c to produce a normal to /7. 

. If (xo, yo, Zo) does not lie on J7, let (x2, y2, z2) be a point on J7 different 
from (x1, y1, Z1), let 6 represent the angle between w := (xo — x2, yo — 
y2,Z9 — Z2) and the normal (a,b,c), and compute cosé@ two different 
ways, once in terms of w and a second time in terms of the distance from 
(xo, yo, Zo) to IT. 


CO ON 


ge 
10. a) eel 


CHAPTER 9 
9.1 Topology of R” 


1. a) Open but not connected. b) Closed and connected. c) Neither open 
nor closed but connected. d) Open but not connected. e) Closed and 
connected. 

6. b) Try a proof by contradiction. 

7. b) Try a proof by contradiction. 

9. Notice that a e E° and E* is open. 

10. Diamonds and squares. 


9.2 Interior, Closure, and Boundary 

1. a) E°=9,E = EU{0}, dE = E.b) E° = E, E =[-1, 1], E = {-1, lh. 
c) E° =R, E=R, dE = @. d) Use Theorem 1.18. 

2. a) E° = {(x, y) tg? 2 0y7 = 1}, E=EanddE = {(x, y) 4x? + Oy? 
=1},. bo =o; F =F and b= 2. 0) B= fon) sy = 205 
O<y<2},E ={(,y) :y>2x?, 0<y <2}, and dE ={(x,y): y= 
2x7,0< y <2}U{(x,2) : -l<x<1}.d) £°=E, E = {x?-2y? 21, 
_l<y<1}, and dE = {x?-2y? =1,-1sysl}u{@,-) ean 
<x<v3lulo,1) V3 <x < v3h. 


10. c) Use part b). You may assume that R” is connected. 
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9.3 Compact Sets 


1. a) Compact. b) Compact. c) Not compact. H = EU{(0,y) :-l1 <y < 1}. 
d) Not compact. There is no compact set H which contains E. 
5. Compare with Exercise 9.1.8. 


9.4 Heine—Borel Theorem 


Te 
8. 


Use the Approximation Theorem for Infima. 
You may use Exercise 4.4.7. 


9.5 Limits of Sequences 


2: 
6. 


8. 


a) (0, 2). b)(O, 1, 1). c) (0, 1, —1/2). 

b) Show that a set C is relatively close in E if and only if E\C is relatively 
open in E 

b) Use the Bolzano—Weierstrass Theorem. 


9.6 Limits of Functions 


de 


7. 


a) Dom f = {(x,y) : y # 1} and the limit is (0,0). b) Domf = 
{(x,y) :x £0, y £0, andx/y # km/2, k odd} and the limit is (@, 3, 0). 
c) Dom f = {(x, y) : (x, y) € (0,0), (-1, 1) and (1, —1)} and the limit is 
(0,0). d) Dom f = {(x, y) : (x, y) # (-1, I) and (—1, —1)} and the limit 
is (-1, 0). 


. a) limy+0 limy+o f(x, y) = limy+o limy.9 f(x, y) = 1/2, but f(x, y) 


has no limit as (x,y) — (0,0). b) limyo limy+o f(x,y) = 1/3, 
lim,,9 limy.9 f(x,y) = 1, so f(x, y) has no limit as (x, y) > (0,0). 
c) limy_+o limy+o0 f(, y) = limy+o limy_+0 f@, y) =0, and f(x, y) > 
0 as (x, y) > (0, 0). 

Use the Mean Value Theorem. 


9.7 Continuous Functions 


. For the converse, suppose not, and use the Sequential Characterization 


of Continuity. 


. See Exercise 9.1.8b. 
. See Theorem 3.40. 
. a) A polygonal path in E can be described as the image of a continuous 


function f : [0,1] — E. Use this to prove that every polygonal path is 
connected. c) Prove that if E is not polygonally connected, then there 
are nonempty open sets U,V Cc EsuchthatUNV =@andUUV=E. 


9.8 Applications 


‘on 


. See Exercise 7.2.7. 
. a)of(t) = 1 for allt. b) w(t) = Oift 4 0 and ws (0) = 1. c) wf (t) = Oif 


t £Oand w¢(0) =2. 


. a) /1/3. b) (e — 1) f (0). c) 2/5. d) (e4 — 1)/2e?. 


. d) You may wish to use Theorem 4.18. 
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CHAPTER 10 
10.1 Introduction 


8 


10 


. c) Observe that f,(x) = x” does not converge to a continuous function 
on [0, 1]. 

. a) Show that if EF is not bounded, then there exist x, € E anda € X 
such that p(x, a) > coasn > oo. 


10.2 Interior, Closure, and Boundary 


1. 


2: 


\Oo © 


Veet Ea BU Gr = Bb) b= Fb = [an (1s 
n €N}U {0}. c) E° =R, E =R, JE = 9. d) See Theorem 1.18. 

a) Closed. E? = {(x,y) :2x?+6y? <3} and dE = {(x, y) :2x?+ 
6y” = 3}. b) Closed. E? = @ and E = E. c) Neither open nor closed. 
Bata) i SOx, O<y<2},E={(,y) -y > 8x2, 0<y <2}, 
and dE = {(x,y) :y=8x?, O<y <2} U {((@,2) :-1/2 <x < 1/2}. 
d) Open. E= {(, y) rx? —y?<4,-2<y <2} and dE = {(x, y): 
y—yPa4,-2<y<2} U {(,2) : VB <x < vBI U | (x, -2): 


—V8<x< J8 r 
. Notice that a € E*° and E* is open. 
. See the description of relative balls following Definition 10.49. 
. To show that f is continuous at a, consider the open interval J= (f(a)—e, 


f(a) +68). 


10.3 Compact Sets 


1 


1. a) Compact. b) Compact. c) Not compact. H = EU{(0, y): -Il<y<1}. 
d) Not compact. There is no compact set H which contains E. 

5. Compare with Exercise 10.2.10. 

8. a) Notice that if NH; = @, then {X \ Hx} covers X. 

0. a) Let x, € E. Does E contain a point a such that each B,(a), r > 0, 
contains x; for infinitely many k’s? b) See Exercise 10.1.10a. 


10.4 Connected Sets 


1 


1 


9. Use Exercise 10.4.8. 

0. a) A polygonal path in E can be described as the image of a continuous 
function f : [0,1] > E. Use this to prove that every polygonal path is 
connected. c) Prove that if E is not polygonally connected, then there 
are nonempty open sets U, V Cc EsuchthatUNV =#andUUV = E. 

1. Try a proof by contradiction. 


10.5 Limits of Functions 


1 
9 


. a)R. b) [a,b]. c) &. d) {x} if E is infinite, 0 if E is finite. e) 9. 
. b) See the proof of Theorem 3.26. 


10.6 Continuous Functions 


2 


. b) Note that f~!(—1, 1) is not open. Does this contradict Theorem 
10.58? 
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4. You may wish to prove that A is relatively closed in E if and only if E\ A 
is relatively open in E. 

8. See Theorem 3.40. 

9. See Remark 1.39. 


10.7 Stone—Weierstrass Theorem 
4. If P is a polynomial, what is rs f (x) P(x) dx? 
7. If P is a trigonometric polynomial, what is 1 f(x) P(x) dx? 


CHAPTER 11 


11.1 Partial Derivatives and Partial Integrals 
1. a) fry = fox = 2ye*. b) fry = fyx = Cos (xy) —xy sin (xy). C) fry = fx = 
4y/ (1 —2y?)’. 
2. a) fe = (3x8 + 6x5y3 — 3x2y%) / (x3 + y3)? for (x, y) € (0,0) and 
f: (0,0) = 0. f; is continuous everywhere on R?. b) fe = (2x/3)- 
(2x? + Sy) / (x? + as for (x, y) (0,0) and f, (0,0) = 0. fy is con- 
tinuous everywhere on R?. 
. a)e!. b) (e — 3) sin(a — 1). c) 2/3. 
. a) 2. b) e*.c) 4. 
_a) (16 = 49/8) /21. b) e~7 /2. 
10. c) Choose 6 > O such that |¢(t)| < e for 0 < t < 6 and break the integral 


in part b) into two pieces, one corresponding to 0 < t < 6 and the other 
to 6 <t < oo. d) Combine part b) with Theorem 11.8. 


11. a) L{te} = 1/(s +2)*. b) L{tcosmt} = (s* 2?) /(s? + mn)”, 
c) £{r? sint} = 2 (3s? — 1) /(s2 +1)°. 
11.2 The Definition of Differentiability 


NDMN 


2. When f is differentiable at a and its domain is one dimensional, does 
||f(a + h) — f(a)||/|h| have a limit as h > 0? 
9. b) The function f might not be differentiable when a = 1. 


11.3 Derivatives, Differentials, and Tangent Planes 


1. a) D(f+g)(x,y)=[4x+2 3y*—4] 
and 
D(f - 8), y) = [12x + 2y° — l6xy xy? — 8x? — 16y"]. 
b) D(f +g) (x,y) =[cosx+3y 3x +ysiny —cosy] 
and 
D(f -g)(, y) = [By sinx — 3y? cos y+ 3xy cos x3xy* sin y — 6xy cos y 


+3.x sin x]. 


1 
c) D(f +g), y)= peers ab ee a 


and 


D(f-g)@, y= [y? cos (xy) +y Inx+y sin (xy) +xy cos (xy) +x Inx] ; 
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1 1 
DEHN D=[ye 41 = mal 


and 


D(f + 8) (x,y, 2) =[2xyztyz?—y—x—2y—z 447 Z4x27x7y + 2xyz— yl]. 
. a) 2x —6y+z=-—5.b)2x —2y—z=4.c)z-we=l. 
(10172 1/0), x ey a ST 
. a)x —z=0. b) There are none. 
. a) dz=8x dx + 18y dy. b) dz=—ysin (xy) dx—x sin (xy) dy. c) dz = 4y 
(—x? fees 4) / GF a ye 4)” dx+4x Ge Sys 4) / Ge SS 4) dy 
9. dw = .68 and Aw © 0.703684. 
10. L must be measured with no more than 3% error. 


11.4 The Chain Rule 


okhWN 


1 ox dF dy OF 0z dw OF dx dF dy dF dz 
- dw/dp = OF /dx—-+ ‘ = 
dp oyop oz ap aq dx dq dydq daz dq 


a-w 0 (OF\ dx OF ax | a (AF \ dy 

ap? ~ dp G ap” Ox Op) | dp (5) ap 
OF a*y a (dF\ dz OF 82z 
dy dp2 sap (=) dp dz dp? 


_ OF a°x 2 OF d*y n OF 82z 
~ ax dp2 dy dp? Az Ap? 


OF () _vF (2) + °F i 
ax2 \ap) ° dy? \ap dz2 \dp 
ae oF a 
5) a ax dy 42 0 Ox 0z 
axdy \dp ap dxdz \dp ap 
a-F (a y 
+2 ee 
dydz \ dp ap 
6. Notice by Exercise 11.2.11 that this result still holds if “f is in C?” is 
replaced by “the first-order partial derivatives of f are differentiable.” 


9. Take the derivative of F(x, f(x)) = 0 with respect to x. 
10. Compute the derivative of f - f using the Dot Product Rule. 


’ 


11.5 The Mean Value Theorem and Taylor’s Formula 


Tea) fOoy =H 14 OF DH OE) FOr Hf? Oe hoe) + 


1 1? 1)? -13 
(yt. b) Vet yya24 544 -SP G+ Ses 
5(y=))* 


aq for some (c,d) € L((x,y);(1, 1). ¢) e&% =ltx-y+y 

cad 

(x? —xy+ y?) ae 4 (x3 —x?y +xy? — y?) ee (x4 By + x2y? 
xy? + y*) for some (c,d) € L(x, y); (0, 0)). 

2. Notice that by Exercise 11.2.11, this result still holds if “f is in C?” is 


replaced by “the (p — 1)-st order partial derivatives of f are differen- 
tiable.” 
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10. Apply Taylor’s Formula to f(a +x,b+ y) for p = 3,x =rcosdé, and 
y =rsin@, and prove that 


A 20 
mr 


= fix(a,b) — fyy(a, b) + Fr), 


f(a+rcosé,b+~,rsin@) cos(26) dd 


where F(r) is a function which converges to 0 asr > 0. 

11. c) Let (x2, 2) be the point identified in part b), and observe by one- 
dimensional theory that u,(x2, t2) = 0. Use this observation and Tay- 
lor’s Formula to obtain the contradiction w, (x2, t2) — w;(x2, t2) > 0. 


11.6 The Inverse Function Theorem 


1. a) Since f(x, y) = (a, b) always has a solution by Cramer’s Rule and 
Df is constant, 


2/5 1/5 


1 = 
Dt" ca.) = [303 15 


(see Theorem C.7). 
b) Since f( 459", GED) — g(2km, 2kx) = 0,0), 


bf! (0, 1) = E or : ai (see Theorem C.7) 


depending on which branch of f~! you choose. 
c) Since f (+2, +3) = f (+3, +2) = (12, 13), 


_ [43/10 1/5 +1/5 +3/10 
a aa | as Sieh one ad 


(see Theorem C.7) 


depending on which branch of f~! you choose. 
d) Since f (0, 1) = (0, —1), one branch of f~! satisfies 


0 


1 _ |t/e 
Df o-v=[") : 


(see Theorem C.7). 


4. F (xo, yo, uo, vo) = (0,0), x4 A yo, and uo # O # vo, where F(x, y, 
u,v) = (xu? + yu? — 3xy — 5, xu? + yu? + 3xy — 11). 


6. a) f—!(s,t) = ((s + Vs? — 41) /2, (s — Vs? — 4t)/2). 


b) pet hetes. = [I be Bey (see Theorem C.1). 


676 Answers and Hints to Selected Exercises 


7 
9 
11 


. Find the partial of F(x),..., gj+1,..-, Xn) with respect to x;. 


. Use the Implicit Function Theorem. 
. a) ax + by = 1, where a* +b? = 1.b)x+y—z=+H1. 


11.7 Optimization 


1 


8. 
10. 


CHAPT 
12.1 Jo 
1. 


. a) f 1/2, 3/4) = 7/16 is a local minimum and (—1/3, 1/3) is a sad- 
dle point. b) Let j,k € Z f(ja,kma) = 2. is a local maxi- 
mum if k and j are even, f (jaz,ka) = —2 is a local minimum if 
k and j are odd, and (jz,kz) is a saddle point if 7 + k is odd. 
c) This function has no local extrema. d) f(0,0) = 0 is a local min- 
imum if a > 0 and b* — 4ac < 0, a local maximum if a < 0 and 
b? — 4ac < 0, and (0, 0) is a saddle point if b? —4ac > 0. 

. a) f(2,0) = 8 is the maximum and f(—4/5, +/21/5) = —9/5 is the 
minimum. b) f(1, 2) = 17 is the maximum and f (1, 0) = 1 is the mini- 
mum. c) f(1, 1) = f(—1, —1) = 3 is the maximum and f(—1, 1) = —5 
is the minimum. 

. a) f(—2,0) = —2 is the minimum and f(1/2,+/15/2) = 17/4 
is the maximum. b) f(+2//5,+1//5) = 0 is the minimum and 
f(t1/V5, 2/4/5) = 5 is the maximum. c) A = xy, 3u = x+y, 
f(+1/V2, £1/V2, 0) = —1/2 is the minimum and f(+1/V/6, +1/V6, 
+2//6) = 1/6 is the maximum. d) f(1, —2,0, 1) = 2 is the minimum 
and f(1, 2, —1, —2) = 3 is the maximum. 

. b) If DE < 0, then ax + by + cz has no extremum subject to the con- 
straint z= Dx? + Ey’. 
b) f(2, 2, 4) = 48 is the minimum. There is no maximum. 
a) Use Cramer’s Rule. b) Apply Theorem 11.59. 

ER 12 

rdan Regions 


a) V(E; G1) = 3/4, V(E; G2) = 7/16, V(E; G3) = 15/64; v(E; Gm) 
for all m. B) V(E; Gi) = 1, v(E; Gi) = 0; V(E; Go) = 13/16, v(E; Go) 
V(E; G3) = 43/64, v(E; G3) = 5/32. y) VE; G1) = 1, v(E; G1) 
V(E; Go) = 1, v(E; Go) = 1/4; V(E; G3) = 15/16, v(E; G3) = 1/2. 


0 
0; 
0; 


. c) First prove that E is a Jordan region if and only if there exist grids G,, 


such that V(0E; Gn) > 0asm — oo. 


. a) See Theorem 8.15 or 10.30. b) You may use Exercise 12.1.6a. 
. d) Apply part c) to E; = (EF, \ Ez) U Ey. e) Apply parts c) and d) to 


(E) U Eo) = (Ey \ (E19 E2)) U (Eo \ (E19 E2)) U (2,9 £2). 


. Is it true for rectangles? 
. ais acluster point of E if and only if B,(a) N E contains infinitely many 


points for every r > 0 (see Exercise 2.4.8). 


12.2 Riemann Integration on Jordan Regions 
2. a) 5. 


3 


. Show that the difference converges to zero as r > 0+. 
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4. b) Area(E). 
7. a) —1.b) 1/2. 

10. a) Let e > 0 and choose 6 by uniform continuity of ¢. Choose a grid G 
such that U(f, G) — L(f, G) < 6”. Then break U(g 0 f, G) — L(go f,G) 
into two pieces: those j which satisfy M;(¢ 0 f) —mj(¢o f) < 6, and 
those j which satisfy M;(¢ 0 f) —mj;(¢o f) = 6. These two pieces are 
small for different reasons. 

b) Use Example 3.34 and Theorem 12.29. 


12.3 Iterated Integrals 


1. a) 1/4. b) 135/16. c) 2/2 — sin(227)/4z. 

2. a) E = {(x,y) :0< x < 1,6x < y < 4x? +9} and the integral of 
2x +3 over E is 29. b) E = {(x,y) :0 <x < 1,0 < y < x} and 
the integral of cos(x*) over E is sin(1)/2. c) E = {(x, y,z) :0<y< 
Lig Sa = ya eS 3x? + 2y} and the volume of E is —17/60. d) 
E={(,y,z) :0<y<ly<x<ll<z a oa and the integral of 
Jx? + 2z over E is —2(3./3 — 2/2)/15. 

3. a) (log 2)/6. b) 4. c) (e? — 3)2. d) 1/4. 

4. a) 3. b) 91/30. c) 88/105. d) 1/18. 

7. a) See Exercise 12.3.6. 


12.4 Change of Variables 


1. a) (1 — cos4)/4. b) 3/10. c) (/2 + log(1 + V2))(b3 — a3)/6. (Recall 
that the indefinite integral of sec 6 is [ sec = log|sec6 + tan6| + C.) 

2. a) (1 /3/3) sin3. b) 167/(3 - 5-7). 

3. a) (6V6 — 7)4m/5. b) 1 (4e3 — 1 — 2(V8 — Le). c) 16/2/15. 

4. b) ar*d/a. 

5. a) 4/27. b) 9/112. c) 3(e — 1)/e. d) 5. (Use the change of variables 
x=u+v,y=u-—v.) 

6. See Exercise 12.2.3. 

9. See Exercise 8.2.7. 

10. d) x”/?. 


12.5 Partitions of Unity 
3. See Theorem 7.56. 
12.6 The Gamma Function and Volume 


5. Let % represent the spherical change of variables in R” and observe 
that the cofactor |A;| of —psing, sing2...sing,_2sin@ in the matrix 
Dw is identical to Ay,_, if, in Ay,_,, 9 is replaced by g,—2 and each 
entry in the last row of Dy,_ is multiplied by sind. 

8. r? Vol(B,(0))/(n + 2). 
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CHAPTER 13 
13.1 Curves 


5. 


OND 


11. 


a) This curve spirals up the cone x7 + y? = 2* from (1,0, 1) to 
(—e?", 0, e?") and has arc length 3(e77 = 1)/2. b) This curve coincides 
with the graph x = +2y*/*/3,0 < y < 1 (looking like a stylized gull in 
flight) and has arc length (8/2 — 4)/3. c) This curve is a straight line seg- 
ment from (0, 0, 0) to (1, 1, 1) and has arc length V3. d) The arc length 
of the asteroid is 6. 


_ a) 3125/4. b) —ab(a? + ab + b?)/3(a +b). c) 36m. d) GY2+4+5v6) 
. b) Use Dini’s Theorem. 
. Analyze what happens to (x, y) and dy/dx := (dy/dt)/(dx/dt) as 


t > -—o0o,t ~ -l-,t — -t+,t — 0, andt — oo. For example, 
prove that, ast — —1-, the trace of (tf) lies in the fourth quadrant and 
is asymptotic to the line y = —x. 

b) Take the derivative of v’ - v’ using the Dot Product Rule. d) Observe 
that $(t) = v(€(t)) and use the Chain Rule to compute @'(t) and $’(r). 
Then calculate ¢’ x $” directly. 


13.2 Oriented Curves 


1. 


Go 


chs 


8. 


a) A spiral on the elliptic cylinder y* + 9z* = 9 oriented clockwise when 
viewed from far out the x-axis. b) A cubical parabola (it looks like a 
stylized gull in flight) on the plane z = x oriented from left to right when 
viewed from far out the plane y = x. c) A sine wave on the parabolic 
cylinder y = x* oriented from right to left when viewed from far out 
the y-axis. d) An ellipse sliced by the plane x = z out of the cylinder 
y? + 27 = 1 oriented clockwise when viewed from far out the x-axis. 
e) A sine wave traced vertically on the plane y = x oriented from below 
to above when viewed from far out the x-axis. 


. a) 128/3. b) —1-V2/2. c) 0. 
. a)5.b) x(-1+ V5)/2. c) |R|(2 — a — b)/2. d) —sin(1) + 1/3. 
. c) There exist functions w and r on [0, 1] which are C! on (0, 1) \ {j/N : 


j =1,...,N}such that t’ > 0 and w = $; ot on ((j — 1)/N, j/N) for 
each j = 1,...,N. 

c) If F is conservative, consider the case when C is smooth first. If (*) 
holds, use parts a) and b) to prove that F is conservative. 

Use Jensen’s Inequality. 


13.3 Surfaces 


1. 
2. 


a) /2m(b? — a”). b) 4a. c) 47 ab. 

a) d(u,v) = (u,v,u2 — v2), E = {tu,v):—-l1 <u <1, -|lu|l < v < 
jul}, Wit) = U,t1—27), WO = 141-1), ¥30 = @t,0), 
vit) = (¢,-00, h = b = & = iy = [-1,1], and ffpgdo = 
22/3. b) d(u,v) = (u,u?,v), E = [0,2] x [0,4], Wi) = (0°, 4), 
¥o(t) = 6,0), ¥30 = 0,09, WO = 2,89, 1 = b = [0,2], 
Iz = I, = [0,4], and f{,gdo = (4/27)(1457? — 1). c) du,v) = 


5: 
6. 
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(3cosucosv,3sinucosv,3sinv), E = [0,27] x [7/4,7/2], Wi) = 
((3/V2) cost, (3/2) sint, 3/2), W(t) = (cost, 3sint, 0), 7 = [0, 27], 
and SIs gdo = 2717/2. 
b) Use Theorem 12.65. 
If you got 527, you gave up too much when you replaced ||(x, y)|| by 3. 


13.4 Oriented Surfaces 


1. 


2. 
3. 
4. 


a) Since the x-axis lies to the left of the yz-plane when viewed from far 
out the positive y-axis, the boundary can be parametrized by ¢(t) = 
(3sint, 0, 3cost), 1 = [0,27], and f,, F-Tds = —9x. b) The boundary 
can be parametrized by @,(t) = (0, -t, 1 + 21), , = [-1/2, 0]; @o(t) = 
(t,0,1—1t), bb = [0, 1]; and @3(t) = (-t, 1 + 4)/2, 0), 3 = [—1, 0]; and 
Sys -Tds = —1/12. c) The boundary can be parametrized by ¢,(t) = 
(2sint, 2cost, 4), 1; = [0, 27], and @,(t) = (cost, sint, 1), Jy = [0, 27], 
and f, .F-Tds = 32. 

a) 1/2. b) 16. c) 20ab?. d) 1/8. 

a) —14/15. b) 47ra3/3. c) (3b4 + 8a? — 8(a* — b*)3/*) 1/12. d) —21°/3. 

b) Use Theorem 12.65. 


13.5 Theorems of Green and Gauss 


1 


1 


a) 8/3. b) 3log3 + 2(1 — e3). c) —157/4. 


2. a) (b—a)(c — d)(c +d — 2)/2. b) —1/6. c) 0. 
3. a) 2(5+ 3). b) x. c) 8. d) mabc(a +b + c)/2. 
4. 
5 
9 
0 


a) 224/3. b) 2(8./2 — 7)/15. c) 24z. 


. b) 3/2. c) Vol(E) = (1/3) f,, x dy dz + ydzdx + zdx dy. d) 2x7 ab’. 
. c) Use Exercise 13.5.8 and Gauss’s Theorem. 
. €) Use Green’s Theorem and Exercise 12.2.3. 


13.6 Stokes’s Theorem 


. a) —7/4. b) 2777/4. 
_ a) 0. b) —3. c) —10m. d) —1/12. 
. a) 27/5. b) —1/(8V/2). c) 28m (not —28m because i x k = —j). d) 327. 


e) —7. 


_ a) 1877. b) 87. c) 3(1 — e) + 37/2. d) 0. 
. b) 2z. 


CHAPTER 14 
14.1 Introduction 


dL 


a) ag(x?) = 27/3, ax (x*) = 4(—1)*/k?, and by(x*) =0 for k=1,2,.... 
b) All Fourier coefficients of cos” x are zero except ay(cos* x) = 1/2 and 
an(cos? x) = 1/2. 


. aba(f) =Ofork =0,1,..., be(f) = 4/(k2) when k is odd and 0 when 


k is even. 
c) You may wish to use Theorem 9.60. 


680 Answers and Hints to Selected Exercises 


14.2 Summability of Fourier Series 


5. b) See Exercise 10.7.6d. c) See Exercise 5.1.4b. 
7. b) See Exercise 10.7.6d. 
8. See Theorem 9.69. 


14.3 Growth of Fourier Coefficients 
4. See Exercise 14.2.4a and Theorem 7.12. 
14.4 Convergence of Fourier Series 


1. Note: It is not assumed that f is periodic. 

2. c) 27/8. 

4. a) Use Abel’s Formula. For the first identity, you must show that p% Sy 
— OasN —> ooforallp € (0, lif Sar a,r* converges for allr € (0, 1). 

5. a) Prove that for each fixedh, ay(f(x+h)) = ag(f) coskh+b;,(f) sinkh. 


Subject Index 


Abel summable, 261(6) 
Abel’s Formula, 223 
Abel’s Test, 227(4) 
Abel’s Theorem, 254 
Absolute curvature, 549(10) 
Absolutely convergent series, 212, 
213, 245 
rearrangements of, 216 
Absolutely integrable, 181 
Absolute value, 24 
Accuracy (see Rate of 
Approximation) 
Additive identity, 20 
Additive inverse, 20 
Additive Property, 21 
Algebra of sets, 16, 46 
Algebraic number, 54(7) 
Almost everywhere continuous, 346 
Almost everywhere convergent, 
611(8) 
Alternating Series Test, 224, 230 
Analytic continuation, 273 
Analytic function, 263 
Angle between vectors, 286 
Approximation Property, 31, 36 
Arc, 539 
(See also Curve) 
Arc length, 540, 545 
Arc length differential, 544 
Archimedean Principle, 32 
Archimedes’ approximation to 
12 
Area, 147, 148, 468 
of a parallelogram, 300(7) 
of a rectangle, 463, 469 
surface, 564, 567 
(See also Volume) 
Arithmetic mean, 29 
Arithmetic-geometric mean, 72 
Associative Properties, 19 
Astroid, 539 


Axiom(s), Completeness, 32 
Field, 19 
of Induction, 37 
Order, 20 
Well-Ordering, 37, 638 


Balls, in a metric space, 358 
relative, 382 
in R”, 302 
volume of, 530 
Basis, usual, 285 
Bernoulli’s Inequality, 128 
Bernstein’s Theorem, 271, 625(5) 
Bessel functions, 250(8) 
Bessel’s Inequality, 612 
Bijection, 43 
Binomial coefficients, 39, 269 
Binomial Formula, 40 
Binomial Series, 270 
Bolzano—Weierstrass Property, 376, 
386 
Bolzano—Weierstrass Theorem, 70, 
326 
Borel Covering Lemma, 319 
Bound, lower, 34 
upper, 30 
Boundary, manifold, 566 
of a set, 313, 367 
of a surface, 566 
topological, 566 
Boundary point, 566 
Bounded function, 99 
Bounded sequence, 58, 324, 360 
above, below, 58 
uniformly, 243(3) 
Bounded set, 34 
above, 30 
below, 34 
interval, 27 
Bounded variation, function 
of, 184 
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Cantor set, 354(9) 
Cantor—Lebesgue Lemma, 630-631 
Cantor’s Diagonalization Argument, 
50 
Cantor’s Intersection Theorem, 376 
Cantor’s Uniqueness Theorem, 631 
Cartesian product, 16, 52, 397 
Cauchy Criterion 
for absolute convergence, 212-213 
for sequences, 72-73, 324, 326, 360 
for series, 202 
for uniform convergence, 241 
Cauchy—Riemann equations, 429 
Cauchy-Schwarz Inequality, 287 
Cauchy sequence, 72, 324, 360 
Cauchy’s example, 264 
Cauchy’s Mean Value Theorem, 125 
Cauchy’s Theorem, 73 
Cesaro means, 606 
Cesaro summability, 205(10), 606 
Chain Rule, 120, 426-427 
Change of variables, on an interval, 
170, 172-173 
on a Jordan region, 509, 527 
on an open set, 525-526 
Chord, 112, 190 
Circular helix, 552 
Clopen, 316, 381 
Closed ball, 302, 358 
Closed curve, 538 
Closed form, of the Dirichlet kernel, 
602-603 
of a power series, 259 
Closed Graph Theorem, 352 
Closed interval, 26-27 
Closed set, 302-304, 358 
limit characterization, 327, 360-361 
Closed surface, 566 
Closure of a set, 311, 366 
Closure Properties, 19, 22 
Cluster point, 75, 382 
Codomain, 16 
Commutation 
(See Interchange the order of) 
Commutative Properties, 19 
Compact set, 317-318, 321, 370 
sequentially compact, 75, 319, 363 


Compact support, 517 
Comparison Test, 208-209 
Comparison Theorem, for functions, 
87-88, 385 
for improper integrals, 179-180 
for integrals, 159 
for multiple integrals, 485 
for sequences, 64, 66(7b) 
for series, 208 
Complement, 16 
Complete Archimedean ordered field, 
22 
Complete metric space, 362 
Completeness Axiom, 32 
Completeness Property for Infima, 36 
Completeness of the trigonometric 
system, 610 
Component, of a function, 328 
of a vector, 281 
Composition, 98 
continuity of, 98, 385 
Concave function, 189 
Conditionally convergent series, 212 
rearrangements of, 219 
Conditionally integrable, 181 
Cone, 651 
parametrization of, 561 
Connected set, 307, 377 
characterization in R, 308, 378 
polygonally connected, 344, 381 
Conservative vector field, 557(7) 
Constraints, extrema with, 455, 457 
Continuity, 97, 338, 385 
characterization 
by open sets, 339-340, 343(3), 387 
by sequences, 97, 338, 385 
uniform, 106, 338, 374 
Continuous, function, 97, 338, 385 
image of a compact set, 340-341, 
387-388 
image of a connected set, 340-341, 
387-388 
integrability, 148, 480 
inverse image, 339-340, 343(3,4,5), 
387 
on a metric space, 385 
on R, 97 


on R”, 338-339 
uniformly, 106, 338-339, 374 
Continuously differentiable, 
116, 413 
Continuously extended, 108 
Contradiction, 23 
Convergence, absolute, 212, 245 
almost everywhere, 611(8), 626 
conditional, 212 
of a function, 82, 91, 330, 384 
interval of, 253 
in a metric space, 360, 383 
pointwise, 236, 245 
radius of, 251 
of sequences, 55, 324, 360 
of series, 198 
uniform, 238, 245, 403 
(See also Limit) 
Convergence Question, 601 
Convex functions, 189 
continuity, 192 
differentiability, 195 
Convex set, 432 
Coordinate function, 328 
Coordinate hyperplane, 496 
Coordinates, 281 
Corona, 566 
cos x, 122(9), 638 
Taylor series of, 265 
Countable set, 50 
at most countable, 50 
Countable subcovering, 317, 370 
Covering, cover, 317, 321, 370 
Cramer’s Rule, 650 
Cross product, 289, 426 
Cube with side s, 397 
Curl, 582, 587(9) 
Curvature, 549(10) 
Curve, 538, 545 
arc length of, 540, 545 
closed, 538 
explicit, 539 
parametrization of, 538, 544 
piecewise smooth, 545 
rectifiable, 546 
simple, 543, 545 
smooth, 543 
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Cylinder, parametrization of, 559 
Cylindrical coordinates, 511 


Darboux’s Theorem, 129 
Decimal expansion, 66, 199 
Decreasing function, 125 
Decreasing sequence, 67, 345 
Deductive proof, 23 
Degenerate interval, 27 
DeMorgan’s Laws, 47 
Dense set, 344, 391(8) 
Density, of irrationals, 36 
of rationals, 33 
Derivative, 112, 116, 410 
curl, 582, 587(8) 
definition in R”, 410 
directional, 416, 430(11) 
divergence, 582, 587(8) 
of higher order, 113, 433-434 
of an integral, 166, 402, 405 
partial, 397, 537 
of a real-valued function, 112, 116 
second symmetric, 626 
of a series, 245, 255-256 
total, 410 
of a vector-valued function, 410, 537 
(See also Differentiable function) 
Determinant, 647 
Diagonalization Argument, 50 
Differentiable function, 112, 116, 409 
at a point in R, 112 
at a point in R”, 409 
on a set, 116, 537 
(See also Derivative) 
Differential, Leibnizian, 169 
total, 421, 433-434 
use for approximation, 421—422 
Differential form (See Form) 
Differentiating under an integral sign, 
403, 405 
Differentiation of an integral, 166 
Differentiation of a series, 245, 
255-256 
Dilation, 476(7) 
Dimension, 281 
Dini’s Theorem, 345 
Directional derivative, 416, 430(11) 
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Direct method, 456 
Dirichlet function, 102, 149 
Dirichlet kernel, 601 
Dirichlet-Jordan Theorem, 623 
Dirichlet’s Test, 224 
for uniform convergence, 246 
Discontinuous, 101 
Discrete metric, 357 
Discriminant for local extrema, 454 
Disjoint, 470 
Distance, 27, 284, 300(8), 324(7) 
Distributive Law, 19 
Divergence of a function, 582, 587(8) 
Divergence Test, 200 
Divergence Theorem, 583 
Divergent sequence, 62 
Divergent series, 199 
Domain, of a real-valued function, 16 
of a relation, 16 
maximal domain, 328 
Dominated, 99, 332 
Dot product, 282 
Double angle formulas, 641 
Double integral (See Iterated integral) 
Double series, 248 
Dyadic rational, 37 


e, 128, 266, 277-278 
Element (of a set), 15 
Ellipsoid, 651 
Empty set, 15 
Empty sum, 464 
Endpoints, 27, 538 
Equality 
of functions, 16 
of ordered pairs, 16 
of rationals, 21 
of sets, 16 
of vectors, 281 
Equivalence relation, 658-659 
Equivalent, orientation, 550, 571 
smoothly, 544, 545, 564, 567 
Estimation, 228 
Euclidean distance, 284 
Euclidean norm, 283 
Euclidean space, 281 
Euclid’s Theorem, 275 


Euler’s proof, 275 
Euler’s Theorem, 277-278 
Exact function, 596(8) 
Explicit curve, 539 
Explicit surface, 559 
Exponential function, 17, 105(11), 
176(8) 
Taylor series, 265-266 
Extended real number, 35 
Comparison Test, 210 
Comparison Theorem, 66(7), 96(4) 
limits, 62, 92 
Sequential Characterization, 94 
Squeeze Theorem, 66(7), 96(4) 
Extrema with constraints, 455, 457 
Extreme Value Theorem, 99, 342, 389 
Extremum, local, 449 


Factorial, 39 

via Gamma function, 529 
Fejér kernel, 602 
Fejér’s Theorem, 609 
Field Axioms, 19 
Finer, 144, 463 
Finite difference, 224 
Finite set, 50 
Finite subcovering, 317, 321, 370 
First Mean Value Theorem, 161 
First-order partial, 397 
First partials commute, 399 
Fixed point, 105(4) 
Folium of Descartes, 549(9) 
Form, differentiable 

of degree 1, 554 

of degree 2, 573 
Fourier coefficients, 600 
Fourier series, 600 
Fourier’s Theorem, 600 
Fubini’s Theorem, 491, 504(10) 
Function, 16 

algebra, 391 

analytic, 263 

bounded, 99 

of bounded variation, 184 

component, 328 

convex, 189 

continuous, 97, 338, 385 


coordinate, 328 

differentiable, 112, 116, 409 

domain, 16, 328 

exponential, 17, 105(11), 176(8) 

harmonic, 429(6), 588(10d) 

increasing, 125 

inverse, 43 

limit, 82, 91, 330, 383, 384 

linear, 295 

logarithmic, 17, 176(7) 

monotone, 125 

nowhere continuous, 102 

nowhere differentiable, 278 

periodic, 599 

product of, 86, 331 

real, 17 

real-valued, 17, 328 

Riemann integrable, 148, 478 

support, 517 

sum of, 85, 331 

trigonometric, 122(9), 638 

uniformly continuous, 106, 338, 374 

variation of, 184 

vector-valued, 328 

Fundamental Theorem of Absolute 
Values, 25 
Fundamental Theorem of Calculus, 

166 


Gamma function, 528 

Gauss’s Theorem, 583 

Generalized Binomial Coefficients, 
269 

Generalized Mean Value Theorem, 
125 

Geometric mean, 29 

Geometric series, 201 

Gibbs’s phenomenon, 605(6) 

Global property, 505 

Gradient, 412, 430(11), 588(9) 

Greatest lower bound (See Infimum) 

Green’s Identities, 588 

Green’s Theorem, 579 

Grid, 463 


Hardy’s Tauberian Theorem, 621 
Harmonic function, 429(6), 588(10d) 
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Harmonic series, 200 

Heat equation, 429(7), 437(11) 
Heine—Borel Theorem, 321, 374 
Helix, 552 

Holds for large n, 56 
Homogeneous Rule, 120 
Hyperbolic paraboloid, 651 
Hyperboloid, 651 

Hyperplane, 293, 496 


i, 285 
Image, inverse image, 45 
Implicit Function Theorem, 444 
Implicit Method, 461(8) 
Improper integral, 178 
absolutely integrable, 181 
Comparison Theorem, 179 
conditionally integrable, 181 
convergence, 178 
Fubini’s Theorem, 504 
uniform convergence, 403 
Increasing function, 125 
Increasing sequence, 67 
of functions, 345 
Indeterminate form, 64, 136-137 
Induced orientation, 574 
Induction, 37 
Inductive set, 634 
Inequality, Bernoulli’s, 128 
Bessel’s, 612 
Cauchy—Schwarz, 287 
Jensen’s, 193 
triangle, 25, 288, 356 
Infimum, 34 
Infinite series, 198 
(See also Series) 
Infinity, 35 
Inner product, 282 
Inner sum, 472 
Inner volume, 473 
Integers, 21 
Integrability, 148, 478 
of the absolute value, 159, 485 
of a continuous function, 148, 480 
Lebesgue’s characterization, 349, 
487 
of a monotone function, 154(8) 
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Integrability 
of a product, 160, 489(8) 
of asum, 157, 481 
Integrable, on an interval, 148 
on a Jordan region, 478 
locally, 178, 522 
on an open set, 525 
(See also Improper integral) 
Integral, 150, 478 
differentiation of, 166, 402, 405 
on an interval, 150 
iterated, 490 
on a Jordan region, 478 
on a line (unoriented), 544, 545 
lower, 150, 477 
on an open set, 525 
oriented line, 551, 554 
oriented surface, 571, 574 
on a set of volume zero, 484 
on a surface (unoriented), 564, 567 
upper, 149, 477 
(See also Integrability) 
Integral Test, 207, 228 
Integration by parts, 169 
Interchange the order of, 98 
a derivative and an integral, 402, 
405 
iterated limits, 336 
a limit and a derivative, 242 
a limit and a function, 98 
a limit and an integral, 240, 401, 404 
mixed partial derivatives, 399, 
417(11) 
the order of integration, 491, 
504(10) 
the order of summation, 221(6) 
(See also Term-by-term) 
Interior, of a set, 311, 365 
of a surface, 566 
Intermediate Value Theorem, 101, 
129, 344(9), 390 
Intersect, two sets, 312, 367 
Intersection, of two sets, 16 
of a family of sets, 47 
image of, 47 
Interval, 27 
Interval of Convergence, 253 


Inverse function, 16, 43 

Inverse Function Theorem, 141, 440 
Inverse image, 45 

Invertible matrix, 646 

Irrational number, 21, 277 
Isoperimetric Problem, 623 

Iterated integral, 490 

Iterated limits, 334 


j, 285 
Jacobian, 438, 515(6) 
Jacobian matrix, 411 
Jensen’s Inequality, 193 
Jordan content, 467 
Jordan Curve Theorem, 538 
Jordan region, 467 
characterization, 456, 473 
image of, 472 
projectable, 495 
union of, 469, 476(6) 
of volume zero, 466 


k, 285 
Kernels, 601-602, 611(6) 


Lagrange Multipliers, 457 
Lagrange’s Theorem, 268 
Laplace Transform, 407 
Laplace’s equation, 429(6), 588(10) 
Large n, holds for, 56 
Law of Cosines, 643 
Least element, 37 
Least squares, 461(10) 
Least upper bound (See Supremum) 
Lebesgue’s Theorem, 349, 487 
Left-hand derivative, 194 
Left-hand limit, 91 
Leibnizian differential, 169 
Length, 27, 468, 540 
l’H6pital’s Rule, 134 
Lie between, 101 
Limit, of functions, 82, 91, 330, 383 
iterated, 334 
left-hand, right-hand, 91 
one-sided, 91 
of sequences, 55, 324, 360 
sequential characterization, 84, 94, 
330, 384 


of series, 198 
two-sided, 82, 91 

Limit Comparison Test, 210 

Limit infimum, 76 

Limit supremum, 76, 214 

Lindel6df’s Theorem, 318, 373 

Line, parametric form, 285, 538 

Line integral, 544, 545 
oriented, 551, 554 

Line segment, 286 

Linear function, 114, 295 

Linear Properties, 157, 481 

Linear regression, 461(10) 

Lipschitz class, 625(5) 

Local extrema, 129(8), 449 
with constraints, 455, 457 
discriminant for, 454 
Lagrange Multipliers, 457 

Locally integrable, 178, 522 

Local property, 505 

Logarithm function, 17, 176(7) 

Logarithmic Test, 233 

Lower bound, 34 

Lower integral, 150, 477 

Lower Riemann sum, 145 

Lower sum, 477 


Maclaurin series, 264 
Manifold boundary, 566 
Mathematical Induction, 23, 37 
Matrix, algebra of, 644-645 
inverse, 646, 649 
representation of linear functions, 
297 
representation of vectors, 296 
transpose, 649 
Maximum, of a function, 100, 119(8), 
192 
in R?, 449 
Mean(s), arithmetic, 29 
arithmetic-geometric, 72 
Cesaro, 606 
geometric, 29 
Mean Value Theorem, 125, 431 
for integrals, 161, 162, 486 
Cauchy’s, 125 
Generalized, 125 
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real-valued functions, 125, 430 
vector-valued functions, 431 
Measure zero, 346, 476(8), 487 
Metric space, 356 
Minimum, of a function, 100, 119(8), 
192 
Mixed second partials, 398 
MObius strip, parametrization of, 570 
Modulus of continuity, 619 
Monotone Convergence Theorem, 67, 
71(6) 
Monotone functions, 125 
continuity, 127 
integrability, 154(8) 
Monotone Property, 35 
Monotone sequences, 67, 345 
Multiplication (See Product) 
Multiplicative, 24 
Multiplicative identity, 20 
Multiplicative inverse, 20 
Multiplicative Properties, 21 


Natural logarithm, 17, 176(7) 
Natural numbers, 21 
Natural parametrization, 542, 549(11) 
n-dimensional rectangle, 397, 463 
n-dimensional region, 558 
Negative part, 28(3), 89(7) 
Nested Interval Property, 69 
Nested sets, 69 
Newton’s Theorem, 276 
Nondegenerate interval, 27 
Nonempty set, 16 
Nonnegative number, 21 
Nonoverlapping, 470 
Norm, of a continuous function, 358 

Euclidean, 283 

€'-, 283 

of a linear function, 298 

of a matrix, 298 

operator, 298 

of a partition, 144 

sup-, 284 

of a vector, 283 
Normal vector 
induced, 562 
to a (hyper)plane, 293 
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Normal vector 
to an explicit surface, 420 
to a surface in parametric form, 
562 
unit, 570, 574 
Nowhere continuous function, 102 
Nowhere differentiable function, 
278 
Numbers, algebraic, 54 
irrational, 21 
natural, 21 
rational, 21 
real, 15,19 
transcendental, 54 


One-sided derivative, 116, 194 
One-sided limits, 91 
One-to-one function, 43 
Onto function, 43 
Open ball, 302, 358 
Open covering, 317, 370 
Open interval, 27 
Open set 
in a metric space, 358 
in R”, 304 
Operator norm, 298 
Order Axioms, 20 
Ordered n-tuple, 281 
Ordered pair, 16 
Orientable surface, 571, 574 
Orientation, induced, 574 
of a curve, 550 
positive, 574 
of a surface, 571, 574 
Orientation equivalent, 550, 571 
Oriented integral, line, 551, 554 
surface, 571, 574 
Oriented positively, 574 
Oriented surface (See Orientable 
surface) 
Orthogonal vectors, 286 
Orthogonality of the trigonometric 
system, 600 
Oscillation of a function, 347 
Outer sum, 463 
Outer volume, 473 


Pairwise disjoint sets, 470 
Paraboloid, 651 
Parallel vectors, 286 
Parallelepiped, 300, 516(9) 
Parallelogram, 284, 300(7) 
Parametric equations 
(See Parametrization) 
Parametrization, of curves, 538, 544 
natural, 542, 549(11) 
of surfaces, 558, 567 
trivial, 539, 559 
Parseval’s Identity, 615 
Partial derivative(s), at a point, 397 
commutation of, 399, 417 
first-order, 397 
mixed, 398 
second-order, 398 
on a set, 537 
Partial differential equations 
Cauchy—Riemann equations, 429 
heat equation, 429(7), 437 
Laplace’s equation, 429, 588(10) 
wave equation, 429(4) 
Partial integral, 397 
Partial Jacobian, 444 
Partial sum, 38, 198, 600 
Partial summation, 224 
Partition, 144, 658 
Partition of unity, 520 
Pascal’s triangle, 39 
Periodic function, 599 
Pi (7), 72(10), 620(7) 
Piecewise smooth curve, 545 
Piecewise smooth surface, 566 
integral on, 567, 574 
orientable, 574 
oriented integral, 574 
surface area, 567 
Pigeonhole Principle, 54(6) 
Plane, 293 
distance from a point, 300(8) 
tangent to a surface, 419, 562 
Point in R” , 281 
Point of accumulation (See cluster) 
Point of discontinuity, 101 
Pointwise convergence, 236, 245 
Pointwise increasing, 345 


Pointwise sum, of functions, 85, 330 
Polar coordinates, 509 
Polygonally connected, 344, 381 
Polynomial, 95(3), 337(4) 
Taylor, 132 
trigonometric, 598 
Positive definite, 25, 356 
Positive number(s), 21, 30(11) 
Positive orientation, 574, 583 
Positive part, 28(3), 89(7) 
Postulates, 19, 20, 32 
Power function, 17, 105(11) 
Power Rule, 121—-122(5,7), 168 
Power series, 251 
closed form, 259 
differentiation, 255 
expansion, 264 
integration, 257 
interval of convergence, 253 
product of, 257-259 
radius of convergence, 251 
uniqueness of, 263 
Preimage, 16 
Prime numbers, 275 
Principle 
Archimedean, 32 
of Mathematical Induction, 37 
Well-Ordering, 37 
Product 
Cartesian, 16, 397 
cross, 289 
of determinants, 648 
of functions, 86, 330 
inner product, 282 
of matrices, 644 
of power series, 257 
scalar, 282 
Product of series, 257-259 
Product Rule, 120, 399 
Projectable region, 495 
Projection, 496 
Proof by contradiction, 23 
Proof by induction, 37 
Proper maximum, 192 
Proper subset, 16 
p-Series Test, 208 
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Quadratic form, 453 

Quadric surface, 651 

Quotient, derivative of, 119, 425(6) 
limit of, 61, 86, 330 
pointwise, 86 

Quotient Rule, 120, 425(6) 


Raabe’s Test, 233 
Radius of convergence, 251 
Rate of approximation, 228-231 
Ratio Test, 215, 231 
Rational number, 21 
Real function, 17 
Real number, 15, 19 
Real-valued, 17, 328 
Rearrangement of series, 216, 
219 
Rectangle, 397 
connectivity, 342 
volume of, 463, 469 
Rectifiable curve, 546 
Refinement, 144 
Reflection, 34 
Region, Jordan, 467 
m-dimensional, 558 
projectable, 495 
of types I, II, or II, 495 
Relation, 16, 658 
Relative balls, 382 
Relatively open/closed, 306, 316(4, 
11), 343(5), 377 
Remainder term, 264 
integral form, 268, 436(5) 
Riemann integral 
(See Integrable and Integral) 
Riemann—Lebesgue Lemma, 613 
Riemann sums, 155 
lower, upper, 145 
Riemann’s Theorem, 219, 629 
Right-hand derivative, 194 
Right-hand limit, 91 
Right-hand orientation, 574 
Rolle’s Theorem, 123 
Root Test, 215, 231 
Rotations, 301(9) 
Rotation invariant, 515(7) 
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Saddle point, 450 
Scalar, 281 
Scalar product, 282 

with a function, 86 
Secant line, 112 
Second Derivative Test, 452 
Second formal integral, 628 


Second Mean Value Theorem, 162 


Second symmetric derivative, 626 
Separable, 325, 372 
Separate(s) a set, 307, 377 
Sequence, 55 
bounded, 58, 324, 360 
Cauchy, 72, 324, 360 
convergent, 55, 324, 360 
divergent, 62 
increasing, 67 
monotone, 67 
pointwise convergent, 236 
uniformly bounded, 243(3) 
uniformly convergent, 238 
Sequential Characterization 
of Continuity, 97, 338, 385 
of Limits, 84, 94, 330, 384 


Sequentially compact, 75, 319, 363 


Series, 198 
absolutely convergent, 212, 245 
alternating, 224 
Cauchy Criterion, 202 
conditionally convergent, 212 
convergent, 198 
divergent, 199 
Fourier, 600 
geometric, 201 
power, 251 
pointwise convergent, 245 
product of, 257-259 
rearrangements, 216, 219 
telescopic, 200 
trigonometric, 598 
uniformly convergent, 245 
Set, 15 
bounded, 34 
Cantor, 354(9) 
closed, 302, 359 
compact, 317, 321, 370 
connected, 307, 377 


countable, 50 
empty, 15 
inductive, 634 
open, 302, 359 
sequentially compact, 75, 363 
uncountable, 50 
of volume zero, 466 
Shift Formulas, 641 
Simple closed curve, 538 
Singularities, 572 
sin x, 122(9), 638 
Taylor series of, 265 
Smooth curve, 543 
Smooth surface, 563 
Smoothly equivalent, curves, 544 
surfaces, 564, 567 
Space, compact metric, 371 
complete metric, 362 
connected metric, 377 
of continuous functions, 358 
Euclidean, 281 
metric, 356 
separable, 325, 373 
Space-filling curve, 537 
Sphere, parametrization of, 560 
Spherical coordinates, 512 
in R”, 530 
Squeeze Theorem 
for extended limits, 66(7) 
for functions, 87, 331 
on a metric space, 384 
for sequences, 60, 66(7) 
Stirling’s Formula, 534 
Stokes’s Theorem, 589 
Stone—Weierstrass Theorem, 393 
Straight line in R”, 285 
Strictly increasing, function, 125 
sequence, 67, 345 
Subcovering, 317, 321, 370 
Subordinate, 522 
Subsequence, 57 
Subset, 15 
proper, 15 
Subspace, 357 
Subspace topology, 378 


Sum, 19 

of functions, 85, 329 

of matrices, 644 

of a series, 199 

of two series, 202 
Sum-angle formulas, 641 
Sum Rule, 120, 418 
Summable 

Abel, 261(6) 

Cesaro, 205(10), 606 
Summability kernels, 611(6) 
Summability Question, 606 
Summation by parts, 224 
Sup-norm, 284 
Support, 517 
Supremum, 30 
Surface, 558 

area of, 564 

closed, 566 

explicit, 559 

integral, 564, 567 

quadric, 651 

orientation of, 571, 574 

parametrization of, 558, 567 

piecewise smooth, 567 

smooth, 562, 563 
Surface area, 564, 567 
Surface integral, 564, 567 

oriented, 571, 574 
Surjection (See Onto) 
Symmetric, 25, 356 
Symmetric derivative, second, 626 


Tangent line, 112, 543 
Tangent plane(s), 419, 448, 569(9) 
and differentiability, 424 
Tangent vector, 543 
unit, 550 
Tauber’s Theorem, 620 
Taylor expansions, 264 
remainder term, 264, 268, 436(5) 
Taylor polynomial, 132 
Taylor series (See Taylor expansions) 
Taylor’s Formula, 131, 265, 435, 436 
Telescopes, 146 
Telescopic Series, 200 
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Term-by-term, differentiation, 245, 
255 

integration, 245, 257 
Test, Abel’s, 227(4) 

Alternating, 224, 230 

Comparison, 208 

Dirichlet’s, 224, 246 

Divergence, 200 

Integral, 207, 228 

Limit Comparison, 210 

Logarithmic, 233 

p-Series, 208 

Raabe’s, 233 

Ratio, 215, 231 

Root, 214, 231 

Second Derivative, 452 

Weierstrass M-Test, 246, 403 
Topological boundary, 566 
Torus, parametrization of, 560 
Total derivative, 410 

uniqueness of, 411 
Total differential, 421, 434 
Total variation, 186 
Trace, of a curve, 538 

of a surface, 558 
Transcendental number, 54 
Transition, between two curves, 544 

between two surfaces, 564 
Transitive Property, 20 
Translation, 476(7) 
Transpose of a matrix, 649 
Triangle Inequalities, 25, 288, 356 
Trichotomy Property, 20 
Trigonometric functions, 122(9), 638 
Trigonometric polynomial, 598 
Trigonometric series, 598 
Trivial parametrization, 539, 559 
Twice differentiable, 113 
Two-sided limits, 82, 91 
Type I, I, or II, 495 


Uncountable set, 50 

Uniform Cauchy Criterion, 241 

Uniform convergence, 238, 245, 403 
(See also Term-by-term) 

Uniformly bounded, 243(3) 
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Uniformly continuous, 106, 338, 374 
characterization on intervals, 108 
integrability, 480 

nion, of two sets, 16 

of a family of sets, 46 

image of, 47 

nique, 16 

niqueness 

of identities and inverses, 633 
of power series, 263 

of the total derivative, 411 

of trigonometric series, 631 
niqueness Question, 601 

nit normal vector, 570, 574 
nit tangent vector, 550 

nity, partition of, 520 

pper bound, 30 

pper integral, 149, 477 

pper Riemann sum, 145 

pper sum, 476 

rysohn’s Lemma, 519 

sual basis, 285 

sual metric, 356 
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Vacuous implication, 359 
Value of a series, 199 
Variables, change of, 170, 172, 509, 527 
Variation, bounded, 184 
total, 186 


Vector(s), 281 
angle between, 286 
components, 281 
difference, 282 
equality, 281 
identification with matrices, 296 
parallel, 286 
sum, 282 
Vector function, 328 
continuity, 338 
differentiability, 409 
Volume, 463, 469, 481 
of a ball, 530 
connection with determinants, 
516(9) 
connection with x, 300(7) 
integral form, 481 
of a parallelepiped, 300(7), 516(9) 
of a rectangle, 463 
of volume zero, 466 


Wave equation, 429(4) 
Weierstrass M-Test, 246, 403 
Weierstrass’s Theorem, 278 
Well-Ordering Principle, 37, 638 


Zero, of area, 466 
of volume, 466 
Zero vector, 281 
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Symbol, description page(s) defined 
A the: empty Set ic. Layad holaide ae Re eed OG ed eae 15 
oo, —00, infinity (negative infinity) ......... 2.00... c cece cece eee ene 35 
€, ¢, an element of (not an element Of).......... 0... cece eee eee ene eens 15 
C, C, is a subset of (is a proper subset Of) .......... 0.0 eee eee eee eee eee 15-16 
AUB, ANB, the union (intersection) of A and B ......... cece eee eee eee 16 
A\B, B‘, the complement of B relative to A (to a universal set) ............. 16 
A x B, the Cartesian product of A and B...... eee eee ee 16 
Uren, Uwea, the union of a sequence (a family) of sets................2.20005 46 
NkeN; Nwea, the intersection of a sequence (a family) of sets................. 47 
lim, ++, lim, g_, the right-hand (left-hand) limit.......................04. 91 
sup E, inf E, the supremum (infimum) of a set E............... 0c cence ee 30, 34 
ie 4k, Dope] Ak, a finite (infinite) sum or series..................0000. 38, 198 
N, the set of natural numbers ........... 0... ccc cece eee n nnn eees 21 
Q, the set of rational numbers. ..... 0... ccc ccc cece e eee eens 21 
R, the set of real numbers ......... 0... ccc ccc cen eee n nn aees 15,19 
Zs theiset Of Mt Pers: 202 sired see AG wae end wha pe hoa neds owed one eee 21 
<, <, less than or equal (strictly less than) .. 20, 21, 35 (extended real numbers) 
la|, the absolute value Of € 0... ccc ccc ence eee nnn eet ee nnnnes 24 
a*,a_, the positive (negative) part of anumberd ...............0..0e cece 28 
if Vg, f \g, the maximum and minimum of two functions.................. 90 
(a, b), [a, b], open (closed) interval with endpoints a and b.................. 27 
||, |R|, the length of an interval J (the volume of a rectangle R)........ 27, 463 
Xn t 4, Xn | a, an increasing (decreasing) sequence which converges toa ....67 
lim sup, lim inf, the limit supremum (infimum) ...................000005 76, 214 
e*, the exponential function............. 0.00 cece eee ence eee 17, 105(11), 176(8) 
log x, the natural logarithm of X......... 0... cece ence eee eee eens 17, 176(7) 
RI 0 the remainder term of the Taylor expansion of f at xo............055 264 
sinx, cosx, Sine (COSINE) Of Xo... keene cece eee e teen ee eeee 122(9), 638 
x”, an irrational power Of X12... 0.0 cece cece nee nent nee eees 17, 105(11) 
I(x), the gamma function evaluated at x... 0... cece ccc een e eee 528 
R", n-dimensional Euclidean space............. 0. cece cece eee ence eee ences 281 
6; 1he Usual basis OL RO < casiecacaseieheensy o.oo men wew he ea heey 285 
IIx||, [[XIloo5 [|x/|1, the norm, the sup-norm (the ¢!-norm) of a vector x...283, 284 
|| B||, the operator norm of a matrix (or linear function) .................... 298 
B,(a), the open ball centered at a of radiusr............ 0... cece eee eee 302, 358 
L(a; b), the line segment between aandD............. 0 cee eee eee eee 285-286 
IIn(a), the (hyper)plane with normal n passing througha.................. 293 
E°, E, the interior (closure) of a set E..........0cccceecceeneeeees 311, 365-366 
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dE, 0S, the boundary of a set E (surface S).............0. eee eee 312, 367, 566 
Area(E), Vol(E), the area (volume) of aset FE ........ 0. cece eee eee ee 467-468 
Vol(E), Vol(E), the outer (inner) volume of E............... 6.0 c eee eee 472-473 
L(C), the arc length of a curve C......... cece eee eee eee eens 540-541, 545 
o(S), the surface area of a surface So... ccc cece eens 564, 567 
f :X > Y,afunction from X tO Yo... ccc cence eee neeenes 16 
ft, f-, the positive (negative) part of a function f ...............0..04. 89-90 
f og, the composition of f With g...... 00. ccc cece cece e nee neenees 98 
f—', the inverse function Of f .........c ccc cece ce ccc ee eee e nese een eeees 16, 43 
f(E), f~'(E), the image (inverse image) of E under f ................0005. 45 
f(a‘), f(@), the right-hand (left-hand) limit of f ata ........ 06... cee eee 91 
f', f, the derivative of f (of order k)....... 0... cece cece eee e ees 112, 113, 115 
Drf, Dif, the right-hand (left-hand) derivative of f.................00005 194 
Df, f, the total derivative of f........0. 0... ee 410, 411-413 
VF ,the sradient Of fait. ceestac tan tebdneh basse aad Cine wine a swadcteeenes 412 
D°?) f(a; h), the pth total differential of f ataandh.....................5. 434 
Ag; the. Jacobian Of foie) ced tata dgoc ha ctinie ei0sd sa ahs woah eadee dak ices 438 
curl F, div F, the curl (divergence) of F........... 0.0 c cece cece cence ees 582 
C?, continuously differentiable of order p............ 2... cece eee eee 116, 398 
C™, C& infinitely differentiable (of compact support)............. 116, 398, 517 
||P ||, the norm of a partition Po... ccc eee e nen ees 144 
U(f, P), L(f, P), the upper (lower) Riemann sum of f over a partition P..145 
U(f,G), L(f, G), the upper (lower) sum of f overagridG............ 476-477 
(U) {, (L) f, the upper (lower) Riemann integral ..................... 149, 476 
Var(y), V(y, P), the variation of g (Over P)..... 2... cece eee ene eens 184 
V(E; G), v(E; G), outer (inner) sum of E with respect toG............ 464, 472 
d, Ng, the tangent (normal) vector induced by @ ..................04. 543, 562 
T, n, the unit tangent (normal) vector of a curve (surface) ............ 550, 570 
J, f 4A, fi, f dV, the Riemann integral of f on E C R?(E CR")......... 478 
J ee ds; shedding. integral Of e-OVEr C w2ciyexagawiee Gta viva sear neewed ete 544 
Jc ¥ -Tds, the oriented line integral of F along C..................45. 551, 554 
f eda, the surtace integral Of @°OVETS wicnjevtenes ies Mien hago de 564, 567 
Is g F -ndo, the oriented surface integral of Fon S................-+045 571, 574 
ax(f), bef), Fourier coefficients of f 0... cece cc cence eee ene eens 600 
Dy, Kn, the Dirichlet (Fejér) kernel of ordern..................0.000% 601-602 
(Sy f)(x), the Nth partial sum of the Fourier series of f evaluated at x..... 600 


(on f)(x), the Nth Cesaro mean of the Fourier series of f evaluated at x ... 606 
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